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In this first part of the book we shall study aspects of classical statistical physics that 
every physicist should know, but are not usually treated in elementary thermodynamics 
courses. Our study will lay the microphysical (particle-scale) foundations for the continuum 
physics of Parts II — VI. As a central feature of our approach, we shall emphasize the intimate 
connections between the relativistic formulation of statistical physics and its nonrelativistic 
limit, and between quantum statistical physics and the classical theory. Throughout, we 
shall presume that the reader is familiar with elementary thermodynamics, but not with 
other aspects of statistical physics. 

In Chap. 2 we will study kinetic theory — the simplest of all formalisms for analyzing 
systems of huge numbers of particles (e.g., molecules of air, or neutrons diffusing through 
a nuclear reactor, or photons produced in the big-bang origin of the Universe). In kinetic 
theory the key concept is the "distribution function" or "number density of particles in phase 
space", A/"; i.e., the number of particles per unit 3-dimensional volume of ordinary space and 
per unit 3-dimensional volume of momentum space. Despite first appearances, M turns out 
to be a geometric, frame-independent entity. This M and the frame-independent laws it 
obeys provide us with a means for computing, from microphysics, macroscopic quantities 
that continuum physics: mass density, thermal energy density, pressure, equations of state, 
thermal and electrical conductivities, viscosities, diffusion coefficients, ... . 

In Chap. 3 we will develop the foundations of statistical mechanics. Here our statistical 
study will be more sophisticated than in kinetic theory: We shall deal with "ensembles" of 
physical systems. Each ensemble is a (conceptual) collection of a huge number of physical 
systems that are identical in the sense that they have the same degrees of freedom, but 
different in that their degrees of freedom may be in different states. For example, the 
systems in an ensemble might be balloons that are each filled with 10 23 air molecules so each 
is describable by 3 x 10 23 coordinates (the x, y, z of all the molecules) and 3 x 10 23 momenta 
(the p x , p y , p z of all the molecules). The state of one of the balloons is fully described, 
then, by 6 x 10 23 numbers. We introduce a distribution function Af which is a function of 
these 6 x 10 23 different coordinates and momenta, i.e., it is defined in a phase space with 
6 x 10 23 dimensions. This distribution function tells us how many systems (balloons) in 
our ensemble lie in a unit volume of that phase space. Using this distribution function 
we will study such issues as the statistical meaning of entropy, the relationship between 
entropy and information, the statistical origin of the second law of thermodynamics, the 
statistical meaning of "thermodynamic equilibrium", and the evolution of ensembles into 
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thermodynamic equilibrium. Among our applications will be a derivation of the Fermi-Dirac 
distribution for Fermions in thermodynamic equilibrium and the Bose-Einstein distribution 
for Bosons, a development of the theory of Bose-Einstein condensation in a dilute gas, an an 
exploration of the meaning and role of entropy in gases, in black holes and in the universe 
as a whole. 

In Chap. 4 we will use the tools of statistical mechanics to study statistical thermody- 
namics, i.e. ensembles of systems that are in or near thermodynamic equilibrium (also called 
statistical equilibrium). Using statistical mechanics we shall derive the laws of thermody- 
namics, and we shall learn how to use thermodynamic and statistical mechanical tools, hand 
in hand, to study not only equilibria, but also the probabilities for random, spontaneous fluc- 
tuations away from equilibrium. Among the applications of these tools that we shall study 
are: (i) a chemical and particle reactions, such as ionization equilibrium in a hot gas, and 
electron-positron pair formation in a still hotter gas; and (ii) phase transitions, such as the 
freezing, melting, vaporization and condensation of water. We shall focus special attention 
on a Ferromagnetic phase transition in which the magnetic moments of atoms spontaneously 
align with each other as iron is cooled, using it to illustrate two elegant and powerful tech- 
niques of statistical physics: the renormalization group, and Monte Carlo methods. 

In Chap. 5 we will develop the theory of random processes (a modern, mathematical 
aspect of which is the theory of stochastic differential equations). Here we shall study 
the dynamical evolution of processes that are influenced by a huge number of factors over 
which we have little control and little knowledge, except their statistical properties. One 
example is the "Brownian motion" of a dust particle being buffeted by air molecules; another 
is the motion of a pendulum used, e.g., in a gravitational- wave interferometer, when one 
monitors that motion so accurately that one can see the influences of seismic vibrations and 
of fluctuating "thermal" ("Nyquist") forces in the pendulum's suspension wire. The position 
of such a dust particle or pendulum cannot be predicted as a function of time, but one can 
compute the probability that it will evolve in a given manner. The theory of random processes 
is a theory of the evolution of the position's probability distribution (and the probability 
distribution for any other entity driven by random, fluctuating influences). Among the 
random-process concepts we shall study are spectral densities, correlation functions, the 
Fokker Planck equation which governs the evolution of probability distributions, and the 
fluctuation-dissipation theorem which says that, associated with any kind of friction there 
must be fluctuating forces whose statistical properties are determined by the strength of the 
friction and the temperature of the entities that produce the friction. 

The theory of random processes, as treated in Chapter 5, also includes the theory of 
signals and noise. At first sight this undeniably important topic, which lies at the heart of 
experimental and observational science, might seem outside the scope of this book. However, 
we shall discover is that it is intimately connected to statistical physics and that similar 
principles to those originally used to describe, say, Brownian motion are appropriate when 
thinking about, for example, how to detect the electronic signal of a rare particle event 
against a strong and random background. We shall study techniques for extracting weak 
signals from noisy data by filtering of the data, and the limits that noise places on the 
accuracies of physics experiments and on the reliability of communications channels. 



Chapter 2 
Kinetic Theory 



Version 0402.2.K, 12 October 2004 

[Changes from 0402. l.K: (i) Some rewording of parts (a) and (b) of Ex. 2.14 on neutron 
diffusion in a nuclear reactor, (ii) Minor typos fixed. 

2.1 Overview 

In this chapter we shall study kinetic theory, the simplest of all branches of statistical physics. 
Kinetic theory deals with the statistical distribution of a "gas" made from a huge number 
of "particles" that travel freely, without collisions, for distances (mean free paths) long 
compared to their sizes. The particles might be, for example, galaxies; and our goal might 
be to study how galaxies, born randomly in the early universe, congregate into clusters as 
the universe expands. Or, the particles might be stars that reside in our own galaxy, and 
our goal might be to study how spiral structure develops in the distribution of our galaxy's 
stars. Or, the particles might be molecules in interstellar space, and our goal might be 
to study how a supernova explosion affects the evolution of the interstellar gas's density 
and temperature distribution. Or, the particles might be photons in the cosmic microwave 
radiation, and our goal might be to study how anisotropics in the expansion of the universe 
affect the photons' temperature distribution. Or, the particles might be electrons in a metal, 
and our goal might be to study how changes of the metal's temperature affect its thermal 
and electrical conductivity. Or, the particles might be neutrons in a nuclear reactor, and our 
goal might be to study whether they can survive long enough to maintain a nuclear chain 
reaction and keep the reactor hot. 

For all such problems, a powerful conceptual tool is the "distribution function" or "num- 
ber density of particles in phase space" . In order to develop this concept in a manner valid 
both for photons, which move with the speed of light, and for galaxies, which move slowly 
compared to light, we shall use the tools of special relativity. We begin in Sec. 2.2 by in- 
troducing the concepts of momentum space, phase space, and the distribution function. In 
Sec. 2.3 we study the distribution functions that characterize systems of particles in thermal 
equilibrium. There are three such equilibrium distributions: one for quantum mechanical 
particles with half-integral spin (fermions), another for quantum mechanical particles with 
integral spin (bosons), and a third for classical particles. As special applications we derive 
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the Maxwell-Boltzmann velocity distribution for low-speed, classical particles (Ex. 2.3), and 
we compute the effects of an observer's motion on his measurement of the cosmic microwave 
radiation created in the big-bang origin of the universe (Ex. 2.4). In Sec. 2.4 we meet the 
number-flux four- vector and the stress-energy tensor associated with a collection of particles, 
and we learn how to compute these quantities by integrating over the momentum portion 
of phase space. We show that, if the momentum distribution is isotropic in some reference 
frame, then on macroscopic scales the particles constitute a perfect fluid, and we use our 
momentum-space integrals to evaluate the equations of state of various kinds of perfect fluids: 
a photon gas (Ex. 2.5), a nonrelativistic, classical gas (Ex. 2.6), and an electron-degenerate 
hydrogen gas (Ex. 2.7), and we use our results to discuss the physical nature of matter as 
a function of density and temperature. In Sec. 2.5 we study the evolution of the distribu- 
tion function, as described by Liouville's theorem and the associated collisionless Boltzmann 
equation when collisions between particles are unimportant, and by the Boltzmann Transport 
Equation when collisions are significant, and we use a simple variant of these evolution laws 
to study the heating of the Earth by the Sun, and the key role played by the "Greenhouse 
effect" (Ex. 2.10). Finally, in Sec. 2.6 we learn how to use the Boltzmann transport equation 
to compute the transport coefficients (diffusion coefficient, electrical conductivity, thermal 
conductivity, and viscosity) which describe the diffusive transport of particles, charge, en- 
ergy, and momentum through a gas of particles that collide frequently; and we also use the 
Boltzmann transport equation to study a chain reaction in a nuclear reactor (Ex. 2.14). 



2.2 Phase Space and Distribution Function 
2.2.1 Number density in phase space 

Consider a classical particle with rest mass m, moving through spacetime along a world 
line V((), or equivalently x(C), where ( is an affine parameter related to the particle's 4- 
momentum p by 

p = dx/d(. (2.1a) 

[Eq. (1.28)]. If the particle has non-zero rest mass, then (as we saw in Chap. 1) the particle's 
4-velocity u and the proper time r along its world line are related to its 4-momentum and 
affine parameter by 

p = mu, ( = r/m. (2-lb) 

The particle can be thought of not only as living in four- dimensional spacetime [Fig. 2.1(a)], 
but also as living in a four-dimensional momentum space [Fig. 2.1(b)]. Momentum space, 
like spacetime, is a geometric, coordinate-independent concept: each point in momentum 
space corresponds to a specific 4-momentum p. The tail of the vector p sits at the origin 
of momentum space and its head sits at the point representing p. The momentum-space 
diagram drawn in Fig. 2.1(b) has as its coordinate axes the components (p°, p 1 , p 2 , p 3 ) of 
the 4-momentum as measured in some arbitrary Lorentz frame. Because the squared length 
of the 4-momentum is always — m 2 , 



p-p = -(p ) 2 + {p x f + {p y f + {p z f = -m 2 , 



(2.1c) 
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Fig. 2.1: (a) The world line x(Q of a particle in spacetime (with one spatial coordinate, z, 
suppressed), parametrized by a parameter £ that is related to the particle's 4-momentum by p = 
dx/d(. (b) The trajectory of the particle in momentum space. The particle's momentum is confined 
to the mass hyperboloid, p 1 = — m 2 . 

the particle's 4-momentum is confined to a hyperboloid in momentum space. This hyper- 
boloid is described mathematically by Eq. (2.1c) and is called the mass hyperboloid. We 
shall often denote the particle's energy p° by 



(with the S in script font to distinguish it from the nonrelativistic energy of a particle, 



If no forces act on the particle, then its momentum is conserved and its location in 
momentum space remains fixed. However, forces (e.g., due to an electromagnetic field) can 
push the particle's 4-momentum along some curve in momentum space that lies on the mass 
hyperboloid. If we parametrize that curve by the same parameter ( as we use in spacetime, 
then the particle's trajectory in momentum space can be written abstractly as p(C)- Such 
a trajectory is shown in Fig. 2.1(b). Because the mass hyperboloid is three dimensional, we 
can characterize the particle's location on it by just three coordinates rather than four. We 
shall typically use as those coordinates the spatial components of the particle's 4-momentum, 
(Px, Py, Pz) or the spatial momentum vector p as measured in some specific inertial frame. 

Momentum space and spacetime, taken together, are called phase space. We can regard 
phase space as eight dimensional (four spacetime dimensions plus four momentum-space di- 
mensions). Alternatively, if we think of the 4-momentum as confined to the three-dimensional 
mass hyperboloid, then we can regard phase space as seven dimensional. 

Turn attention, now, from an individual particle to a collection of a huge number of 
particles. Assume, for simplicity, that all the particles have the same rest mass m (they are 
identical particles); and allow m to be finite or zero, it does not matter. Examine those 
particles that pass close to a specific event V in spacetime; and examine them from the 
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Fig. 2.2: Definition of the distribution function from the viewpoint of a specific observer in a 
specific reference frame: At the event V the observer selects a 3- volume dV x , and she focuses on 
the set Q of particles that lie in dV x and have momenta lying in a region of the mass hyperboloid 
which is centered on p and has 3-momentum volume dV p . If dN is the number of particles in that 
set G, then Af{V,p) = dN/dV x dV p . 



viewpoint of a specific observer, who lives in a specific inertial reference frame. Fig. 2.1(a) 
is a spacetime diagram drawn in that observer's frame; as seen in that frame, the event V 
occurs at time t and at spatial location (x,y,z). 

We ask the observer, at the time t of the chosen event, to select a specific, small, 3- 
dimensional box with edge lengths dx, dy, dz, with the event V at its center, and with 
imaginary (i.e., not physical) walls. She measures this box to have an ordinary, three- 
dimensional volume 

dV x = dxdydz . (2.2a) 

Focus attention on a specific box-like region of the mass hyperboloid, centered on a specific 
4-momentum p in momentum space and with spatial momenta in the range (p x ± \dp X) p y ± 
\dp y , p z ± ^dp z ); Fig. 2.2(b). Our chosen observer attributes to that region a 3-dimensional 

momentum volume 

dV p = dp x dp y dp z . (2.2b) 

Ask the observer to focus on that collection Q of particles which lie in the spatial 3-volume 
dV x [Fig. 2.2(a)] and have spatial momenta in the 3-volume dV p [Fig. 2.2(b)]. If there are 
dN particles in this collection Q, then the observer will identify 

as the number density of particles in phase space. This number density depends on the 
location V in spacetime of the 3-volume dV x and on the 4-momentum p about which the 
momentum volume dV p is centered. Regarded as a function of these quantities, J\f(V,p) is 
called the particles' distribution function. It is the fundamental concept of kinetic theory. 

At first sight one might expect M to depend also on the inertial reference frame used in its 
definition, i.e., on the velocity of the observer. If this were the case, i.e., if M at fixed V and 
p were different when computed by the above prescription using different inertial frames, 
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then we would feel compelled to seek some other frame-independent geometric object to 
serve as our foundation for kinetic theory. This is because the principle of relativity insists 
that all fundamental physical laws should be expressible in frame-independent language. 

Fortunately, the distribution function (2.3) is frame-independent by itself, i.e. it is a 
frame-independent scalar field in phase space; so we need seek no further for a geometric 
foundation for kinetic theory. To prove the frame-independence of A/", we shall consider, 
first, the frame dependence of the spatial 3- volume dV X) then the frame dependence of the 
momentum 3-volume dV p , and finally the frame dependence of their product dV x dV p and of 
the distribution function M = dN/ dV x dV p . In studying this frame dependence the thing that 
identifies the 3-volume dV x and 3-momentum dV p is the set of particles Q . We shall select 
that set once and for all and hold it fixed, and correspondingly the number of particles dN 
in the set will be fixed. Moreover, we shall assume that the particles' rest mass m is nonzero 
and shall deal with the zero-rest-mass case later by taking the limit m — > 0. Then there is a 
preferred frame from which to observe the particles: their own rest frame. We shall denote 
by x a the Lorentz coordinates of that rest frame, and shall retain our previous notation x a 
for the Lorentz coordinates of our previous frame. We shall orient the coordinate axes of 
the two frames so they are related by a pure boost in the x direction with speed v and with 
7 = - v 2 ; see Fig. 2.3. 




Fig. 2.3: (a) Spacetime diagram drawn from viewpoint of the (primed) rest frame of the particles 
Q. The length of the box occupied by the particles, dx, as measured in the moving (unprimed) 
frame, is Lorentz contracted, so dx = ~f~ 1 dx' and dV x = l~ 1 dV' x . (b) Momentum space diagram 
drawn from viewpoint of the unprimed observer. The ranges of x-components of momenta occupied 
by the particles Q are related by dp x i = j~ 1 dp x , so that dV' p = r y~ 1 dV p . 

Figure 2.3(a) is a spacetime diagram that shows the world lines of a few of the particles 
in the set Q. These world lines go straight upward because the diagram's chosen reference 
frame is the particles' (primed) rest frame. The leftmost and rightmost world lines demark 
the walls of the particles' box. The box's length dx' as measured in the particles' rest 
frame is the square root of the spacetime interval along the thick horizontal line in the 
figure. Its length dx as measured in the unprimed reference frame is the square root of the 
spacetime interval along the thick slanted line. The slanted line dx is the hypotenuse of a 
right triangle in spacetime with spatial leg dx' and temporal leg vdx'. By the spacetime 
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"Pythagorean theorem" (equation for the interval) applied to this triangle, we have dx 2 = 
dx' 2 — (vdx') 2 = (1 — v 2 )dx' 2 = 7~ 2 cfe /2 , so dx = ^~ l dx' ', which is the standard formula 
for the Lorentz contraction of a moving box along its direction of motion. Since lengths 
are preserved along the perpendicular directions, dy = dy', dz = dz', we conclude that 
dV x = dxdydz = / ~f~ 1 dx'dy'dz' = / y~ 1 dV x >. Since the dN particles all have nearly the same 
energy, £ = 777,7, as seen m the unprimed frame, this Lorentz-contraction relation can be 
rewritten as £dV x = mdV x ', which says that 

£dV x = (a frame- independent quantity) . (2.4a) 

Thus, the spatial volume occupied by the particles is not frame- independent, but the product 
of their volume and their energy is. 

Fig. 2.3(b) shows the mass hyperboloid, the region on that hyperboloid occupied by 
the momenta of the particles Q, and the projections dp x and dp x i of that box on the two 
frames' axes. We have chosen to draw this diagram in the unprimed frame, rather than the 
particles' primed frame, because we thereby obtain a convenient right triangle to use in our 
computation. The hypotenuse of the right triangle is dp x i (thick slanted curve), the horizontal 
leg is dp x (thick dashed curve), and the vertical leg is vdp x ; so the spacetime "Pythagorean 
theorem" says (dp x r) 2 = (dp x ) 2 — (vdp x ) 2 = 7~ 2 ((ip z ) 2 . Therefore, dp x i = / ~f~ 1 dp x , which is a 
reversal from the relation dx = , ~f~ 1 dx'. Correspondingly, there is a reversal from Eq. (2.4a): 

— ^ = (a frame-independent quantity) . (2.4b) 

Cs 

This result can also be understood by examining the little box on the mass hyperboloid 
in Fig. 2.2(b). It is a small element of 3-volume dH p in 4-momentum space [cf. Eq. (1.121) 
and associated discussion] . Regarded as a 4- vector in momentum space, this 3-volume points 
perpendicular to the little box in the diagram, i.e. it points in the same direction as the 
4-momentum p of a particle at the center of the 3-volume; i.e., d!2 p and p are parallel to each 
other. The time component of p is the particle energy £, and the time component of the 
3-volume dT, p is = dV p (the projection of the box into the horizontal plane). Because 
rfSp and p are parallel, when we transform from one reference frame to another, their time 
components must grow or shrink in unison, i.e. the ratio dT^/p Q = dV p /£ must not change; 
it is frame invariant. 

By taking the product of Eqs. (2.4a) and (2.4b) we see that for our chosen set of particles 

G. 

dV x dV p = (a frame-independent quantity) ; (2-4c) 

and since the number of particles in the set, dN, is obviously frame-independent, we conclude 
that 

Kr dN dN . r . , . . 

= dV dV ^ rfV 2 " = Irame-m dependent quantity) . (2.5) 

Here we have introduced the notation dV 2 = dV x dV p for the frame-independent six- dimensional 
phase-space volume occupied by the particles, a notation that suppresses any reference to 
the individually frame-dependent dV x and dV p . 
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Fig. 2.4: Geometric construction used in denning the "specific intensity" I u . 

Although we assumed nonzero rest mass, m ^ 0, in our derivation, the conclusions that 
£dV x and dV p /£ are frame-independent continue to hold if we take the limit and 
the 4-momenta become null. Correspondingly, all of Eqs. (2.4a) - (2.5) are valid for particles 
with zero rest mass as well as nonzero. 

The normalization that one uses for the distribution function is arbitrary; renormalize 
M by multiplying with any constant, and Af will still be a geometric, frame-independent 
quantity and will still contain the same information as before. In this book, we shall use 
several renormalized versions of J\f, depending on the situation. We shall now introduce 
them: 

2.2.2 The distribution function f(t, x, v) for particles in a plasma. 

In Part V, when dealing with nonrelativistic plasmas (collections of electrons and ions that 
have speeds small compared to light), we shall regard the distribution function as a function 
of time t, location x in Euclidean space, and velocity v (instead of momentum), and we shall 
use the notation and normalization 

^ ' ' ^ dxdydzdv x dv y dv z ^ ^ ^ 

However, we shall not use this normalization in the present chapter. 

2.2.3 The distribution function lyjv 2, for photons. 

When dealing with photons or other zero-rest-mass particles, one often reexpresses M in 
terms of the specific intensity, I u . This quantity is defined as follows (cf. Fig. 2.4): An 
observer places a CCD (or other measuring device) perpendicular to the spatial direction n 
of propagation of the photons — perpendicular as measured in her Lorentz frame. The region 
of the CCD that the photons hit has surface area dA as measured by her, and because the 
photons move at the speed of light c = 1, the product of that surface area with the time dt 
that they take to all go through the CCD is equal to the volume they occupy at a specific 
moment of time 

dV x = dAdt . (2.7a) 

The photons all have nearly the same frequency v as measured by the observer, and their 
4-momenta as measured by her have components related to that frequency and to their 
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propagation direction n by 

£ = p° = hv , p = him , (2.7b) 

where h is Planck's constant. Their frequencies lie in a range dv centered on v, and they 
come from a small solid angle dft centered on — n; and the volume they occupy in momentum 
space is related to these by 

dV p = \p\ 2 dnd\p\ = h 3 v 2 dVtdv . (2.7c) 

The photons' specific intensity, as measured by the observer, is defined to be the total energy, 

dS = hvdN , (2.7d) 

that crosses the CCD per unit area dA, per unit time dt, per unit frequency dv, and per 
unit solid angle dfl (i.e., "per unit everything"): 

j - d£ (o^ 
v dAdtdvdtt ' K ' } 

(This I v is sometimes denoted I v q.) From Eqs. (2.5), (2.7) and (2.8) we readily deduce the 
following relationship between this specific intensity and the distribution function: 

Here the factor c 2 has been inserted so that I v is expressed in ordinary (cgs or mks) units 
in accord with astronomers' conventions. This relation shows that, with an appropriate 
renormalization, I u /v 3 is the photons' distribution function. 

Astronomers and opticians regard the specific intensity (or equally well I v /v 3 ) as a func- 
tion of the photon propagation direction n, photon frequency v, location x in space, and 
time t. By contrast, physicists regard the distribution function Af as a function of the pho- 
ton 4-momentum p(on the mass hyperboloid) and of location V in spacetime. Clearly, the 
information contained in these two sets of variables, the astronomers' set and the physicists' 
set, is the same. 

If two different astronomers in two different Lorentz frames at the same event in space- 
time examine the same set of photons, they will measure the photons to have different 
frequencies v (because of the Doppler shift between their two frames); and they will measure 
different specific intensities I u (because of Doppler shifts of frequencies, Doppler shifts of 
energies, dilation of times, Lorentz contraction of areas of CCD's, and aberrations of photon 
propagation directions and thence distortions of solid angles); however, if each astronomer 
computes the ratio of the specific intensity that she measures to the cube of the frequency 
she measures, that ratio, according to Eq. (2.9), will be the same as computed by the other 
astronomer; i.e., the distribution function I u /v 3 will be frame-independent. 



2.2 A The mean occupation number 77 



Although this book is about classical physics, we cannot avoid making occasional contact 
with quantum theory. The reason is that classical physics is quantum mechanical in origin. 
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Classical physics is an approximation to quantum physics, and not conversely. Classical 
physics is derivable from quantum physics, and not conversely. 

In statistical physics, the classical theory cannot fully shake itself free from its quantum 
roots; it must rely on them in crucial ways that we shall meet in this chapter and the next. 
Therefore, rather than try to free it from its roots, we shall expose the roots and profit 
from them by introducing a quantum mechanically based normalization for the distribution 
function: the "mean occupation number" rj. 

As an aid in defining the mean occupation number, we introduce the concept of the 
density of states: Consider a relativistic particle of mass m, described quantum mechanically. 
Suppose that the particle is known to be located in a volume dV x (as observed in a specific 
Lorentz frame) and to have a spatial momentum in the region dV p centered on p. Suppose, 
further, that the particle does not interact with any other particles or fields. How many 
single-particle quantum mechanical states 1 are available to the free particle? This question is 
answered most easily by constructing, in the particle's Lorentz frame, a complete set of wave 
functions for the particle's spatial degrees of freedom, with the wave functions (i) confined 
to be eigenfunctions of the momentum operator, and (it) confined to satisfy the standard 
periodic boundary conditions on the walls of a box with volume dV x . For simplicity, let the 
box have edge length L along each of the three spatial axes of the Lorentz coordinates, so 
dV x = L 3 . (This L is arbitrary and will drop out of our analysis shortly.) Then a complete 
set of wave functions satisfying (i) and (ii) is the set {ipj,k,i} with 

^(x, y, z) = -LjWW'+kv+We-M , ( 2 .i 0a ) 

where 

u = y/(m/H) 2 + (2ir/L) 2 (j 2 + k 2 + I 2 ) . (2.10b) 

Here the demand that the wave function take on the same values at the left and right faces 
of the box (x = —L/2 and x = +L/2), and at the front and back faces, and at the top 
and bottom faces (the demand for periodic boundary conditions) dictates that the quantum 
numbers j, k, and / be integers. Note that w is a relativistic frequency, i.e. it includes the 
influence of the particle's rest mass, i.e. it is related to the particle's energy by S = hu; 



and correspondingly, Eq. (2.10b) is the relation S = \Jm 2 + |p| 2 [Eq.(2.1e)] in disguise. The 
basis states (2.10a) are eigenfunctions of the momentum operator (ft/i)V with momentum 
eigenvalues 

2ixh 2nh 2nh 

Px = — , Py = — , Pz = — ■ (2.10c) 

Thus, the allowed values of the momentum are confined to "lattice sites" in 3-momentum 
space with one site in each cube of side 2irh/L. Correspondingly, the total number of states 
in the region dV x dV p of phase space is the number of cubes of side 2nh/L in the region dV p 
of momentum space: 

dNstates ~ {2nh/Lf ~ (2^p " -fcs— ^ 10d ) 



1 A quantum mechanical state for a single particle is called an "orbital" in the chemistry literature and in 
the classic thermal physics textbook by Kittel and Kroemer (1980); we shall use physicists' more conventional 
but cumbersome phrase "single-particle quantum state" . 
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This is true no matter how relativistic or nonrelativistic the particle may be. 

Thus far we have considered only the particle's spatial degrees of freedom. Particles can 
also have an internal degree of freedom called "spin". For a particle with spin s, the number 
of independent spin states is 

if m 7^ 0; e.g., an electron or proton or atomic nucleus, 
g s = ^2 if m = & s > 0; e.g., a photon (s = 1) or graviton (s = 2), (2-11) 

if m = & s = 0; i.e., a hypothethical massless scalar particle. 

We shall call this number of internal spin states the particle's multiplicity . 

Taking account both of the particle's spatial degrees of freedom and its spin degree of 
freedom, we conclude that the total number of independent quantum states available in 
the region dV x dV p = dV 2 of phase space is dN states = (g s /h 3 )dV 2 , and correspondingly the 
number density of states in phase space is 

A/ Statcs = - - 3 • (2.12) 

Note that, although we derived this number density of states using a specific Lorentz reference 
frame, it is a frame-independent quantity, with a numerical value depending only on Planck's 
constant and (through g s ) the particle's rest mass m and spin s. 

In quantum statistical mechanics one often focuses attention on the number of particles 
that reside in a given quantum state. That number is called the state's occupation number. 
Select a specific event V in spacetime and a specific 4-momentum p for particles of a specific 
type. Then the ratio of the number density of particles at (V,p) to the number density of 
states is the mean occupation number rj of the states near (P,p): 

ri(V,p)=^p^-=-M. (2,3) 

■/V states Qs 

Because rj is our most fundamental version of the distribution function, we shall generally 
use this relation in the rearranged form 

< 2 ' 14 > 

From quantum field theory we learn that the allowed values of the occupation number 
for a quantum state depend on whether the state is that of a fermion (a particle with spin 
1/2, 3/2, 5/2, . . .) or that of a boson (a particle with spin 0, 1, 2, . . .). For fermions no two 
particles can occupy the same quantum state, so the occupation number can only take on 
the eigenvalues and 1. For bosons one can shove any number of particles one wishes into 
the same quantum state, so the occupation number can take on the eigenvalues 0, 1, 2, 3, 
. . .. Correspondingly, the mean occupation numbers must lie in the ranges 

< rj < 1 for fermions , < rj < oo for bosons . (2.15) 



Quantum field theory also teaches us that when 77 <SC 1, the particles — whether fermions or 
bosons — behave like classical, discrete, distinguishable particles; and when 77 ^> 1 (possible 
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only for bosons), the particles behave like a classical wave [if the particles are photons 
(s = 1), like a classical electromagnetic wave; and if they are gravitons (s = 2), like a 
classical gravitational wave] . 

EXERCISES 

Exercise 2.1 Example: Distribution function for Particles with a Range of Rest Masses 
Consider a collection of particles (e.g. stars) that are not all identical, but instead have 
different rest masses. 

(a) For a subset Q of particles like that of Fig. 2.2 and associated discussion, but with a 
range of rest masses dm centered on some value to, introduce the phase-space volume 
d 2 V = dV x dVpdm that the particles Q occupy. Explain why this occupied volume is 
frame invariant. 

(b) Show that this invariant occupied volume can be rewritten as d 2 V = (dV x S/m)(dV p dS) = 
(dV x S /m)(dp°dp x dp y dp z ). Explain the physical meaning of each of the terms in paren- 
theses, and show that each is frame- invariant. 

If the number of particles in the set Q is dN, then we define the frame-invariant distribution 
function by 

rfiV dN 
= ^PV~ dV x dV p dm ' ^ ' ' 

This is a function of location V in 4-dimensional spacetime and location p in 4-dimensional 
momentum space (not confined to the mass hyperboloid), i.e. a function of location in 8- 
dimensional phase space. We will explore the evolution of this distribution function in Box 
2.1 below. 

Exercise 2.2 Example: Regimes of Particulate and Wave-like Behavior 

(a) A gamma-ray burster is an astrophysical object (probably a fireball of hot gas exploding 
outward from the vicinity of a newborn black hole or colliding neutron stars or black 
holes) at a cosmological distance from earth (~ 10 10 light years). The fireball emits 
gamma rays with individual photon energies as measured at earth £ ~ 100 keV. These 
photons arive at Earth in a burst whose total energy per unit area is roughly 10~ 6 
ergs/cm 2 and that lasts about one second. Assume the diameter of the emitting surface 
as seen from earth is ~ 1000 km and there is no absorption along the route to earth. 
Make a rough estimate of the mean occupation number of the burst's photon states. 
Your answer should be in the region r\ <C 1, so the photons behave like classical, 
distinguishable particles. Will the occupation number change as the photons propagate 
from the source to earth? 
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(b) A highly nonspherical supernova in the Virgo cluster of galaxies (40 million light years 
from earth) emits a burst of gravitational radiation with frequencies spread over the 
band 500 Hz to 2000 Hz, as measured at earth. The burst comes out in a time of about 
10 milliseconds, so it lasts only a few cycles, and it carries a total energy of roughly 
10~ 3 MqC 2 , where M Q = 2 x 10 33 g is the mass of the sun. The emitting region is about 
the size of the newly forming neutron-star core (10 km), which is small compared to 
the wavelength of the waves; so if one were to try to resolve the source spatially by 
imaging the waves with a gravitational lens, one would see only a blur of spatial size 
one wavelength rather than seeing the neutron star. What is the mean occupation 
number of the burst's graviton states? Your answer should be in the region r] 1, so 
the gravitons behave like a classical gravitational wave. 



2.3 Thermal Equilibrium 

In Chap. 3 we will introduce with care, and explore in detail, the concept of "statistical 
equilibrium" — also called "thermal equilibrium". For now, we rely on the reader's prior 
experience with this concept. If a collection of many identical particles is in thermal equilib- 
rium in the neighborhood of an event V then, as we shall see in Chap. 3, there is a special 
Lorentz frame (the mean rest frame of the particles near V) in which the mean occupation 
number takes on the following form: 

q = e (£-J/kT + I forfermions, (2.17a) 

V = e {E-J/kT _ i ^ bosons. (2.17b) 

Here S = p° is the total energy of an individual particle as measured in that mean rest 
frame. It is expressible in terms of the particle 4-momentum p and the rest-frame 4- velocity 
Uri by 

£ = —p ■ u T f (2.18a) 
[Eq. (1.69)]; in terms of the spatial momentum p as measured in the mean rest frame, it is 

£ = (m 2 + p 2 )5 . (2.18b) 

The quantity k in Eqs. (2.17) is Boltzmann's constant, k = 1.381 x 10 _16 erg K _1 = 1.381 x 
10~ 23 J K _1 ; T is the temperature of the collection of particles and hr is the particles' 
relativistic chemical potential (the chemical potential including rest mass). When we remove 
the rest mass we get the nonrelativistic chemical potential 



fx = jiR-m 



(2.19) 
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The chemical potential hr, the temperature T, and the rest-frame 4-velocity w r f can depend 
on location in spacetime V . Thus, the dependence of the equilibrium r\ on V is through 
fiuiV), T(V), and u r f(V); while its dependence on the particle 4-momentum is only through 
£ = —p- u r {. 

Notice that the equilibrium mean occupation number (2.17a) for fermions lies in the 
range to 1 as required, while that (2.17b) for bosons lies in the range to oo. The 
equilibrium fermion distribution (2.17a) is called the Fermi-Dirac distribution, while the 
equilibrium boson distribution (2.17b) is called the Bose-Einstein distribution. In the regime 
(to — HRj/kT ^> 1 (i.e. /i <C —kT), the mean occupation number is small compared to unity 
for all particle energies £ > to, and is the same for fermions and bosons: 

v = e -(£-wO/w w hen n = Hr- to < -kT . (2.20) 

This is the regime of classical, distinguishable-particle behavior, and this limiting distribution 
is called the Boltzmann distribution. 

The temperature T should already be familiar to the reader as a measure of the average 
energy per particle in thermal equilibrium. The chemical potential jiR might not be so 
familiar. We shall learn its nature by exploring the consequences of the thermal distributions 
(2.17a), (2.17b), (2.20). As we shall see, /j,r is a measure of how many particles are present. 
It can be positive or negative (though for bosons it cannot exceed the rest mass to; otherwise 
7] would be negative at low energies, £ ~ m); 2 and the more positive fj,R is, the larger is the 
number of particles. 

In the special case that the particles of interest can be created and destroyed completely 
freely, with creation and destruction constrained only by the laws of 4-momentum conser- 
vation, the particles quickly achieve a thermodynamic equilibrium in which the chemical 
potential vanishes, (j,r = (as we shall see in Chap. 3). For example, photons inside a box 
with perfectly emitting and absorbing walls that have temperature T acquire the mean occu- 
pation number (2.17b) with zero chemical potential; expressed in terms of photon frequency 
v as measured in the box's rest frame, the corresponding distribution function in its various 
normalizations takes on the standard black-body (Planck) form 

' e hu/kT _ I ' J ft3 e hu/kT _ I ' v e hu/kT _l ■ \^-^ 1 J 

(Here, in the third expression, we have inserted the factor c~ 2 so that l v will be in ordinary 
units.) On the other hand, if one places a fixed number of photons inside a box whose walls 
cannot emit or absorb them but can scatter them, exchanging energy with them in the pro- 
cess, then the photons will acquire the Bose-Einstein distribution (2.17b) with temperature 
T equal to that of the walls and with nonzero chemical potential /ir fixed by the number of 
photons present; the more photons there are, the larger will be the chemical potential. 

When the particles have finite rest mass to and kT is small compared to to, then almost 
all of them will have speeds small compared to light. In this nonrelativistic regime, we shall 



2 For bosons with fi r extremely close to m, i.e. fi ~ 0, a boson condensate can occur; we shall study boson 
condensates in Sec. 3.4.4. 
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replace the particle energy £ = mj \J\ — v 2 = ^m 2 + |p| 2 by its nonrelativistic limit 



1 i D |2 

E = £ -m = -mv 2 = — , (2.22) 

2 2m ' 



from which the rest mass has been removed, and we shall use the nonrelativistic chemical 
potential /j, — HR — m. The thermal distributions then take on nonrelativistic forms that are 
identical to their fully relativistic forms (2.17a), (2.17b), (2.20), but with /i R replaced by \x 
and £ replaced E: 3 

7] = (E-^/kT _[_ i ^ or f errmons (Fermi-Dirac distribution), (2.23a) 

V = iw- VTPf f° r bosons (Bose-Einstein distribution), (2.23b) 

rj = e ~ ( - E ~^^ kT for classical regime \i ^ —kT (Boltzmann distribution). (2.23c) 

EXERCISES 



Exercise 2.3 Example: Velocity Distribution for Thermalized, Classical Particles 
Consider a collection of thermalized, classical particles with zero spin and nonzero rest mass, 
so they have the relativistic Boltzmann distribution (2.20). Denote by t,x,y,z the Lorentz 
coordinates of their mean rest frame. 

(a) Show that the chemical potential determines the total number density of particles n 
in physical space, as measured in the mean rest frame; more specifically, show that 
n = dN/dxdydz oc e Mfl/fcT . 

(b) Show that the distribution of particle energies as measured in the mean rest frame is 

dN /4tt 



dxdydzd£ 



{% 6llR/kT ) £ ^ £2 - m2e ~ £/kT ■ ( 2 - 24 ) 



(c) What is the distribution of particle speeds v = |v|, dN / dxdydzdvl Show that in the 
limit t; C 1 your answer reduces to the Maxwell velocity distribution 

dN fW^„\ 2exp M^!\ (2.25) 



dxdydzdv \ h 3 J \ kT 

Draw a graph of this distribution. 



3 Lyndcn-Bcll (1967) identifies a fourth type of thermal distribution which occurs in the theory of violent 
relaxation of star clusters. It corresponds to individually distinguishable, classical particles (in his case stars 
with a range of masses) that obey the same kind of exclusion principle as fermions. 



15 



Exercise 2.4 Example: Observations of Cosmic Microwave Radiation from Earth 
The universe is filled with cosmic microwave radiation left over from the big bang. At each 
event in spacetime the microwave radiation has a mean rest frame; and as seen in that mean 
rest frame the radiation's distribution function r\ is almost precisely isotropic and thermal 
with zero chemical potential: 

V = eh J To _ x » with T = 2.73 K . (2.26) 

Here v is the frequency of a photon as measured in the mean rest frame. 

(a) Show that the specific intensity of the radiation as measured in its mean rest frame 
has the Planck spectrum 

I = W£V (2 27) 

lv e hu/kT„ _ i ■ \ z - z <) 

Plot this specific intensity as a function of wavelength and from your plot determine 
the wavelength of the intensity peak. 

(b) Show that rj can be rewritten in the frame-independent form 

1 = e -**oU _ x » ( 2 - 28 ) 
where p is the photon 4-momentum and u is the 4- velocity of the mean rest frame. 

(c) In actuality the earth moves relative to the mean rest frame of the microwave back- 
ground with a speed v of about 600 km/sec toward the Hydra- Cent aurus region of the 
sky. An observer on earth points his microwave receiver in a direction that makes an 
angle 9 with the direction of that motion, as measured in the earth's frame. Show that 
the specific intensity of the radiation received is precisely Planckian in form [Eq. (2.21)], 
but with a direction-dependent Doppler- shifted temperature 



T = tJ . (2.29) 
\ 1 — v cos 9 J 

Note that this Doppler shift of T is precisely the same as the Doppler shift of the 
frequency of any specific photon. Note also that the 9 dependence corresponds to 
an anisotropy of the microwave radiation as seen from earth. Show that because the 
earth's velocity is small compared to the speed of light, the anisotropy is very nearly 
dipolar in form. What is the magnitude AT/T of the variations between maximum and 
minimum microwave temperature on the sky? It was by measuring these variations 
that astronomers 4 discovered the motion of the earth relative to the mean rest frame 
of the cosmic microwave radiation. 



4 Corey and Wilkinson (1976), and Smoot, Gorenstein, and Muller (1977). 
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2.4 Macroscopic Properties of Matter as Integrals over 
Momentum Space 

2.4.1 Number-Flux Vector and Stress-Energy Tensor 

The constraint that the laws of physics be frame-independent relationships between frame- 
independent objects is a powerful one. We shall see an example of that power in this section. 

Our objective is to study macroscopic properties of a distribution of particles. Guided by 
the Principle of Relativity (Chap. 1), we shall describe those properties as geometric objects 
(scalar, vector, and tensor fields) that live in spacetime, and we shall try to express those 
geometric objects as frame-independent integrals over momentum space. The integrals must 
satisfy a number of properties: 

i. Their integrands must involve the distribution function M = {g s /h 3 )f], since M is the 
only frame-independent object that we have to describe the state of the particles. 

ii. The most interesting of the integrals (and the only ones we shall study) will be linear 
in A/", so that if we double the number of particles present, we double the amount of 
the quantity being computed. 

Hi. Besides A/", the only other frame- independent quantity we know that lives in momentum 
space is the 4-momentum p\ and, thus, the integrand can involve A/", p, and no other 
quantities. 

iv. The integration element must be frame-independent, so it must be dV p /S = dp x dp y dp z /E 
when expressed in terms of the momentum components in any Lorentz frame. 

The only integrals satisfying these four properties (aside from normalization) are 

R = J Af^, (2.30a) 



= / A/y^, (2.30b) 
J Afp^p^, (2.30c) 



and higher-order (cubic, quartic, . . .) "moments" of A/". Here and throughout this chap- 
ter, momentum space integrals unless otherwise specified are taken over the entire mass 
hyperboloid. We shall explore each of the geometric, frame-independent entities (2.30) by 
computing its components in a Lorentz frame. 

In any Lorentz frame the scalar field R of Eq. (2.30a) is 

R = j Mjdp x d Vy dp z . (2.31) 

This is the sum, over all particles in a unit 3-volume, of the inverse energy. Although it is 
intriguing that this quantity is frame- independent, it is not a quantity that appears in any 
important way in the laws of physics. 
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By contrast, the 4-vector field S of Eq. (2.30b) plays a very important role in physics. 
Its time component in a Lorentz frame is 

Mdp x dp y dp z = / dp x dp y dp z . (2.32a) 

Obviously this is the number of particles per unit three-dimensional volume n, i.e., the 
particle number density, as measured in the Lorentz frame: 

S° = n . (2.32b) 

The ^-component of S, similarly, is 

s _ f M dp x dp y dp z _ f dN dx 

•Ji - / -/V p x — /,,,,,, UPxaPyUPz 

J t J dxdydzdp x dp y dp z dt 



dN 



dtdydzdp x dp y dp z 

Here the second equality follows from 

p x p x dx/d( dx 
T ~ f ~ dt/d( ~ ~dt 



yK. 

dp x dp y dp z . (2.32c) 



= (re-component of velocity) . (2.32d) 



It should be obvious from Eq. (2.32c) that S x is the flux of particles in the x-direction, i.e., 
the number of particles that cross a unit area orthogonal to the x-direction per unit time, 
as measured in the Lorentz frame. Similarly, S y and S z are the fluxes of particles in the 
y and z directions; and S (denoted Sj in index notation) is the particle flux vector — a 3- 
dimensional vector residing in the three-dimensional space of the chosen Lorentz frame. The 
full, 4-dimensional quantity S (S^ in index notation) is the number-flux 4-vector introduced 
in Sec. 1.11.2. 

— * — * — * 

If the particles are conserved, then S satisfies the conservation law S a . a = 0; i.e., V-S = 0; 
or in a Lorentz frame dn/dt+'V-S = [Eq. (1.130)]. Notice that dn/dt has the interpretation 
of the increase in the number of particles in a unit volume per unit time, i.e. the increase in the 
number of particles per unit 4-dimensional volume of spacetime; and the term V • S accounts 
for this increase by the flow of particles across the spatial volume's walls. Correspondingly, 
if particles are actually being created or destroyed, the frame-invariant evolution law for S 
must take the form 

V-S = (number of particles created minus number annihilated per unit spacetime volume). 

(2.33) 

Turn to the quantity T^ u defined by the integral (2.30c). By evaluating its components 
in a chosen Lorentz frame, we can easily verify that this is the particles' stress-energy tensor, 
which we studied in Sec. 1.12. Specifically, the component 

T" = / Mp ^^ = [ . . // . . P il dp x dp y dp z (2.34a) 
J P J dxdydzdp x dp y dp z 
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is obviously the /i-component of 4- momentum per unit volume; i.e., T 00 is the energy per unit 
volume and is the j-component of momentum per unit volume. Similarly, the component 

7 P J dxdydzdp x dp y dp z dt 

= I a + a a ^ a a P^dp x dp y d Pz (2.34b) 
y dtdydzdp x dp y dp z 

is the amount of /i-component of 4-momentum per unit time that crosses a unit area orthog- 
onal to the x-direction; i.e., it is the x-component of flux of /^-component of 4-momentum. 
More specifically, T 0x is the x-component of energy flux and T^ x is the x component of 
spatial-momentum flux — or, equivalently, the jx component of the stress tensor. These and 
the analogous expressions and interpretations of T m and T^ z can be summarized by the 
familiar stress-energy-tensor relations 

T 00 = (energy density) , T j0 = (momentum density) = T -' = (energy flux) , 

T jk = (stress tensor) . (2.34c) 

Notice that the energy density includes the contribution of rest mass, since £ = p° includes 
rest mass. Later we shall find quite useful the following specialization of Eq. (2.30c) to the 
stress tensor: 



Tjk = J NpjPk 
~ J J\fp 



dp x dp y dp z 



£ 

^dpxdpydp^ ^ e nonre i a tivistic (low-velocity) limit. (2.34d) 



m 



The mean rest frame of the particles, at some event V, is that frame in which the 
particle flux Sj vanishes. We shall denote by w r f the 4-velocity of this mean rest frame. 
Especially interesting are distribution functions M that are isotropic in the mean rest frame, 
i.e., distribution functions that depend on the magnitude |p| = p of the spatial momentum 
of a particle, but not on its direction. Since the energy of a particle as measured in the mean 
rest frame is also a function of |p| — p and not of the momentum direction (£ = ^/m 2 + p 2 ), 
we can equally well say that isotropic distributions are those for which Af depends on the 
4-momentum only through the energy as measured in the mean rest frame, i.e., only through 

£ = — u T { ■ p expressed in frame-independent form, 

£ = p° = (m 2 +p 2 )2 in mean rest frame . (2.35) 

Notice that isotropy in the mean rest frame, i.e., M = J\f(V,£) does not imply isotropy 
in any other Lorentz frame. As seen in some other ("primed") frame, u r f will have a time 
component u® { = 7 and a space component u' r f = 7V [where V is the mean rest frame's 
velocity relative to the primed frame and 7 = (1 — V 2 )a]; and correspondingly, J\f will be a 
function of 

£ = -UrfP = ll(m 2 + p' 2 ) 1 *-V-p'} , (2.36) 

which is anisotropic: it depends on the direction of the spatial momentum p' relative to the 
velocity V of the particle's mean rest frame. An example is the cosmic microwave radiation 
as viewed from earth, Ex. 2.4 above. 
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2.4.2 Perfect Fluids and Equations of State 

Consider a collection of particles whose distribution function is isotropic in their mean rest 
frame. Such isotropy is readily produced by collisions between the particles. As measured 
in the mean rest frame, the components of the number-flux 4-vector (2.30b) are 

/pco 
Afdp x dp y dp z = 4tt Mp 2 dp = number density , (2.37a) 

Sj = J N^dp x dp y dp z = , £ = ^m 2 + p 2 . (2.37b) 

Here in the third equality of Eq. (2.37a) we have switched from Cartesian to spherical coor- 
dinates in momentum space. The integral (2.37b) vanishes because the integrand Mpj/8 is 
odd in the integration variable pj, and the integration being over the entire mass hyperboloid 
runs from pj = — oo to pj = +oo. Similarly, the time-time component of the stress-energy 
tensor (2.30c), i.e., the energy density as measured in the rest frame, is 

p = T 00 = J M£dp x dp y dp z = 4vr J MEp 2 dp , (2.38a) 

and the energy flux (same as momentum density) is 

T 0j = J Mpjdp x dp y dp z = , (2.38b) 

which vanishes because the integrand is odd in pj. Finally, because the distribution function 
is isotropic in the momentum, the stress tensor Tjk is isotropic, i.e., it is a multiple of the 
metric = 5jk of the rest frame's 3-dimensional space, so 

T jk = P5 jk . (2.38c) 

Here P is a scalar that is defined only in this isotropic situation. 

Notice that the stress-energy tensor (2.38a)-(2.38c) is that of a perfect fluid [Eq. (1.154)]. 
The mean rest frame of the particles, in which we are computing its components, is the rest 
frame of the fluid, and p and P are the fluid's density and pressure as measured in this mean 
rest frame. 

The most computationally useful expression for the fluid's pressure P is that obtained 
by taking one-third the trace of the stress tensor [Eq. (2.34d)]: 

P = i (r . + r „ + t„> = i / m***^ = I iy^=> ■ ( 2 - 38d > 

where p = |p| is the magnitude of the momentum. 

If we know the distribution function for an isotropic collection of relativistic particles, 
Eqs. (2.37a), (2.38a), and (2.38d) give us a straightforward way of computing the collection's 
number density of particles n, its perfect-fluid energy density p, and its perfect-fluid pressure 
P as measured in the mean rest frame. For the thermalized distribution functions (2.17a), 
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(2.17b), and (2.20) [with M = (g s /h 3 )r]}, that calculation gives n, p and P in terms of the 
particles' temperature T and chemical potential /ir. One can then invert the expression for 
n(fj,R, T) to give Pr(u, T) and then insert into the expressions for p and P to obtain equations 
of state for thermalized particles 

p = p(n,T), P = P(n,T). (2.39) 

The details of this calculation are carried out in Ex. 2.5 for a gas of thermalized photons 
with vanishing chemical potential (i.e., with an isotropic, black-body distribution); the result 
is 

p = aT\ P =\pi (2.40a) 

where 

87T 5 k 4 

a = —— = 7.56 x 10- 15 ergcm- 3 K- 4 = 7.56 x 10- 16 Jnr 3 K- 4 (2.40b) 

15 h 6 c 6 

is the "radiation constant." 

As we shall see in Sec. 27.4, when the Universe was younger than about 100,000 years, its 
energy density and pressure were predominantly due to thermalized photons (plus neutrinos 
which contributed roughly the same as the photons), so its equation of state was given by Eq. 
(2.40a) with the coefficient changed by a factor of order unity. Einstein's general relativistic 
field equations (Part VI) required that 

[Eq. (27.46)], where G is Newton's gravitation constant and r was the age of the universe as 
measured in the mean rest frame of the photons. Putting in numbers, we find that 

4.9 x 10- 12 g/cm 3 m 10 6 K 

p = , t- °1 , T ~ - . 2.41b 

(r/lyr) 2 v ^7l^ 

This implies that when the universe was one minute old, its radiation density and temper- 
ature were about 1 g/cm 3 and 6 x 10 8 K. These conditions were well suited for burning 
hydrogen to helium; and, indeed, about 1/4 of all the mass of the universe did get burned 
to helium at this early epoch. We shall examine this in further detail in Sec. 27.4. 

Returning to our use of kinetic theory to compute equations of state: In the nonrelativistic 
limit, the energy £ of a particle becomes m + E where E = p 2 /2m, the relativistic energy 
density p becomes p = mn + e, where e is the energy density excluding rest mass, and Eqs. 
(2.37a), (2.38a), and (2.38d) reduce to the following nonrelativistic expressions: 

/•oo o rcc 2 

n = / NAirp 2 dp , e=-P= Af—4np 2 dp . (2.42) 
Jo 2 Jo 2m 

In Ex. 2.6, we use these expressions to derive the equations of state for a thermalized, 
nonrelativistic (kT <^ mc 2 ) classical (77 1, i.e. p/kT <^ —1) gas; the result is 

n = ^ = ^(2nmkTf 2 e^ kT , (2.43a) 

A TdB h 
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-nkT , P = nkT , 



(2.43b) 



so the mean energy per particle is 



E 



-kT . 



(2.43c) 



Here A^dB = (2irmkT) 3 / 2 /h 3 is the particles' thermal deBroglie wavelength, i.e. the wave- 
length of Schroedinger wave-function oscillations for a particle with the thermal kinetic 
energy E = nkT. Note that the classical regime n <C 1 (i.e. u/kT <C —1) corresponds 
to a mean number of particles in a thermal deBroglie wavelength small compared to one, 
n ^TdB ^ 1) which should not be surprising. 
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Fig. 2.5: Physical nature of matter at various densities and temperatures. 

As an application, consider ordinary matter. Fig. 2.5 shows its physical nature as a 
function of density and temperature, near and above "room temperature", 300 K. We shall 
study solids (lower right) in Part III, fluids (lower middle) in Part IV, and plasmas (middle) in 
Part V. Our kinetic theory tools are well suited any situation where the particles have mean 
free paths large compared to their sizes; this is generally true in plasmas, and sometimes in 
fluids (e.g. air and other gases but not water) and even sometimes in solids (e.g. for electrons 
in a metal). Here we shall focus on a nonrelativistic plasma, i.e. the region of Fig. 2.5 that is 
bounded by the two dashed lines and the slanted solid line. For concreteness and simplicity, 
we shall regard the plasma as made solely of hydrogen. (This is a good approximation in 
most astrophysical situations; the modest amounts of helium and traces of other elements 
usually do not play a major role in equations of state. By contrast, for a laboratory plasma 
it can be a poor approximation; for quantitative analyses one must pay attention to the 
plasma's chemical composition.) 

Our nonrelativistic Hydrogen plasma consists of a mixture of two gases (or "fluids"): 
free electrons and free protons, in equal numbers. Each fluid has a particle number density 
n = p/nip, where p is the total mass density and m p is the proton mass. (The electrons are so 
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light that they do not contribute significantly to p, and because the plasma is nonrelativistic, 
the thermal contributions to p are negligible.) Correspondingly, the equation of state includes 
equal contributions from the electrons and protons and is given by [cf. Eq. 2.43b] 

e = 3(k/m p )pT , P = 2{kjm p )pT . (2.44) 

The upper dashed boundary (Fig. 2.5) on the region of validity for this equation of state 
is the temperature T re i = m e c 2 /k = 6 x 10 9 K at which the electrons become strongly 
relativistic. In Ex. 4.5 we shall compute the thermal production of electron-positron pairs 
in the hot plasma and thereby shall discover that the upper boundary of validity is actually 
somewhat lower than this (Fig. 4.6). The lower dashed boundary is the temperature T rcc ~ 
(ionization energy of hydrogen)/(a few k) ~ 10 4 K at which electrons and protons begin 
to recombine and form neutral hydrogen. In Ex. 4.6 we shall analyze the conditions for 
ionization-recombination equilibrium and thereby shall refine this boundary. The solid right 
boundary is the point at which the electrons cease to behave like classical particles, because 
their mean occupation number p ceases to be small compared to unity. As one can see from 
the Fermi-Dirac distribution (2.23a), for typical electrons (which have energies E ~ kT), the 
regime of classical behavior (rj <C 1; left side of solid line) is p <C — kT and the regime of 
strong quantum behavior (77 ~ 1; electron degeneracy; right side of solid line) is p ^> +kT. 
The slanted solid boundary in Fig. 2.5 is thus the location p = 0, which translates via Eq. 
(2.43a) to 

p = p dcg = (2m p /h 3 )(2itm e kT) 3/2 = 0.01(T/10 4 K) 3/2 g/cm 3 . (2.45) 

Although the hydrogen gas is degenerate to the right of this boundary, we can still 
compute its equation of state using our kinetic-theory equations (2.38a) and (2.38d), so 
long as we use the quantum mechanically correct distribution function for the electrons — 
the Fermi-Dirac distribution (2.23a). In this electron-degenerate region, the proton pressure 
retains its classical value, P oc p, while the electron pressure grows much more rapidly 
with density. Heuristically, this is because the electrons are being confined by the Pauli 
exclusion Principle to regions of ever shrinking size, causing their zero-point motions and 
associated pressure to grow. As a result, the proton pressure becomes negligible and the 
electron pressure dominates. Moreover, when the density gets sufficiently high, the zero- 
point motions become relativistically fast (the electron chemical potential p e = pn e — m e 
becomes of order m e ), so the non- relativistic, Newtonian analysis fails and the matter enters 
a domain of "relativistic degeneracy". Both domains, nonrelativistic degeneracy (p e <C m e ) 
and relativistic degeneracy (p e > m e ), occur for matter inside a massive white-dwarf star— 
the type of star that the Sun will become when it dies. We shall study the structures of such 
stars in the Fluid- Mechanics part of this book (Chap. 12); and in Chap. 25 we shall see how 
general relativity (spacetime curvature) modifies that structure and helps force sufficiently 
massive white dwarfs to collapse. 

The equation of state for electron-degenerate hydrogen will be a key foundation for our 
study of these stars. That equation of state, due to Wilhelm Anderson and Edmund Stoner 
in 1930 [see the history on pp. 153-154 of Thorne (1994)] and derived in Ex. 2.7, is the 
following: 

The electrons' chemical potential including rest mass, pR e , is called their Fermi energy 
(including rest mass) and denoted Sp = Pr £ , the corresponding momentum pp = ^ Sp — m\ 
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is called their Fermi momentum, and we introduce a dimensionless parameter t related to 
these Fermi energy and momentum by 

£ F = ji Re = m e cosh(t/4) , p F = \J £ F — m 2 e = m e sinh(t/4) . (2.46a) 

Then the number density of electrons n e , their total density of mass-energy p e , and their 
pressure P are (Ex. 2.7) 

8n ( Pf\ 3 87r . o 

Ue = 7777 — 7s — = 7777 — 7s smh w 4 ) , 
(h/m e y \m e J {h/m e ) s 

rp F /m e 



8nm e f PF/me 2 r T , 7T r . , , , , 

^ e = 7777 — 7s/ xVl + x 2 dx=— — ^ sinh(f) - t , 
[h/m e y J {h/m e y 

g rPF/ rn e 4 



In a white dwarf star the protons, with their high rest mass, are nondegenerate, the total 
density of mass-energy is dominated by the proton rest mass density, and since there is one 
proton for each electron in the hydrogen gas, that total is 

p ~ m p n e = ,^ m * sinh 3 (t/4) . (2.46c) 
(h/m e ) 6 

The dividing line between nonrelativistic and relativistic degeneracy is the point where 
the electron Fermi momentum is equal to the rest mass, i.e. sinht = 1, which corresponds 
to a total mass density 

Prcl dcs = 3(h/Z[y ~ L0 x lo6 g/ cm3 • ( 2 - 46d ) 

The equation of state (2.46b) implies 

P e oc p 5//3 in nonrelativistic regime, p p rc \ d eg , 

P e oc p 4//3 in relativistic regime, p ^> prd dcg • (2.46e) 

These asymptotic equations of state turn out to play a crucial role in the structure and 
stability of white-dwarf stars [Chaps. 12 and 25; Chap. 4 of Thorne(1994)]. 

EXERCISES 

Exercise 2.5 Example and Derivation: Equation of State for a Photon Gas 

(a) Consider a collection of photons with a distribution function M which, in the mean 
rest frame of the photons, is isotropic. Show, using Eqs. (2.38a) and (2.38d), that this 
photon gas obeys the equation of state P — |p. 
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Table 2.1: Zeta function and Bernoulli Numbers 



(b) Suppose the photons are thermalized with zero chemical potential, i.e., they are isotropic 
with a black-body spectrum. Show that p = aT 4 , where a is the radiation constant of 
Eq. (2.40b). Note: Equations in the note at the end of this problem may be helpful. 

(c) Show that for this isotropic, black-body photon gas the number density of photons is 

A; x 3 



n = VT , where b = 16ttC(3) ( — ) , (2.47a) 



and that the mean energy of a photon in the gas is 



TT 4 



£ 1 = kT = 2.7011780... kT . (2.47b) 

30C(3) 



Here ((q) is the Riemann Zeta function. 
Note: The Riemann Zeta function ((q), for q > 1, is given by 



oo 1 

C(?) =£:-?> ( 2 - 48a ) 



n=l 



and it appears in the following integral, which arises in this problem: 

-dx = T(q)t(q) . (2.48b) 



I 



oo x q-l 



e x - 1 



Here T(q) is the gamma function, which is equal to (q — 1)! if g is an integer. For the special 
case where q is an even integer and is > 2, so q— 1 is odd and > 1, the integral (2.48b) takes 
on the value 

r-oo 2q-l f> 

-dx = (-l) 9 - 1 (27r) 2(? ^, (2.48c) 



Jo 



lo e x - 1 Aq 

where B n is the Bernoulli number. Some values of the Riemann Zeta function and Bernoulli 
number are given in Table 2.1. 



Exercise 2.6 Example and Derivation: Equation of State for a Nonrelativistic, Classical 
Gas 

Consider a collection of identical, classical (i.e., with 77 <§C 1) particles with a distribution 
function Af which is thermalized at a temperature T such that kT mc 2 (nonrelativistic 
temperature). 
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(a) Show that the distribution function, expressed in terms of the particles' momentum or 
velocity in their mean rest frame, is 

M = 9s e ^/kT e - P y2mkT where = | | = mv (2>4g) 

h 6 

with v being the speed of a particle. 

(b) Show that the number density of particles in the mean rest frame is given by Eq. 
(2.43a). 

(c) Show that this gas satisfies the equations of state (2.43b). 

Note: The following integrals, for nonnegative integral values of q, will be useful: 

jHa^<fc=^^v^, (2-50) 
where n\\ = n(n — 2) (n — 4) ... (2 or 1); and 

x 2q+1 e- x2 dx = -q\ . (2.51) 



Exercise 2.7 Problem and Derivation: Equation of State for Electron-Degenerate Hydrogen 
Derive the equation of state (2.46b) for an electron-degenerate hydrogen gas. In your deriva- 
tion, approximate the temperature T as zero, explain why this approximation is justified, 
and deduce that the electrons' mean occupation number is 1 for £ < £p and for 8 > £.p. 
(Note: It might be easiest to compute the integrals with the help of symbolic manipulation 
software such as Mathematica or Maple.) 

Exercise 2.8 Example: Specific Heat for Nonrelativistic, Degenerate Electrons in White 
Dwarfs and in Metals 

Consider a nonrelativistically degenerate electron gas at finite but small temperature. 

(a) Show that the inequalities kT <C \i e <C m e are equivalent to the words "nonrelativis- 
tically degenerate". Here /i e = /iR e — m e is the chemical potential (Fermi energy) 
excluding rest mass. 

(b) Show that the electrons' mean occupation number rj(E) has the form depicted in Fig. 
2.6: It is near unity out to (nonrelativistic) energy E ~ ji e — 2kT, and it then drops 
to nearly zero over a range of energies AE ~ AkT. 

(c) If the electrons were nonrelativistic but nondegenerate, their thermal energy density 
would be e = |n/cT, so the total electron energy (excluding rest mass) in a volume V 
containing iV = nV electrons would be E = ^NkT, and the electron specific heat, at 
fixed volume, would be 

(dE \ 3 
— — J = -Nk (nondegenerate, nonrelativistic). (2.52) 
oT ) y 2 
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Fig. 2.6: The Fermi-Dirac distribution function for electrons in the nonrelativistic, degnerate 
regime kT <C fi e <C m e , with temperature such that kT/fx = 0.03. Note that 77 drops from near one 
to near zero over the range \x — 2kT < E < fi + 2kT. 



Using the semiquantitative form of r] depicted in Fig. 2.6, show that to within a factor of 
order unity the specific heat of degenerate electrons is smaller than in the nondegenerate 
case by a factor ~ kT '/ ' \i e : 




Nk (degenerate, nonrelativistic). (2.53) 



(d) Compute the multiplicative factor in this equation for Cy. More specifically, show 
that, to first order in kT//i e , 



(e) As a specific application, consider hydrogen inside a white dwarf star with density 
p = 10 5 g cm -3 and temperature T = 10 6 K. (These are typical values for a white dwarf 
interior). What are the numerical values of Li e /m e and kT '/ ' \i e for the electrons? What 
is the numerical value of the dimensionless factor (n 2 /2){kT/ /i e ) by which degeneracy 
reduces the electron specific heat? 

(f) As a second application, consider the electrons inside a copper wire in a laboratory on 
earth at room temperature. Each copper atom donates about one electron to a "gas" 
of freely traveling (conducting) electrons, and keeps the rest of its electrons bound to 
itself. What are the numerical values of Li e /m e and kT / ii e for the conducting electron 
"gas"? Verify that these are in the range corresponding to nonrelativistic degeneracy. 
What is the value of the factor (-7T 2 /2)(kT/ Li e ) by which degeneracy reduces the electron 
specific heat? At room temperature, this electron contribution to the specific heat is 
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far smaller than the contribution from thermal vibrations of the copper atoms (i.e., 
thermal sound waves, i.e. thermal phonons), but at very low temperatures the electron 
contribution dominates, as we shall see in the next exercise. 

Exercise 2.9 Example: Specific Heat for Phonons in an Isotropic Solid 

In Sec. 11.3 we will study classical sound waves propagating through an isotropic, elastic 
solid. As we shall see, there are two types of sound waves: longitudinal with frequency- 
independent speed cl, and transverse with a somewhat smaller frequency-independent speed 
ct- For each type of wave, s = L or T, the material of the solid undergoes an elastic 
displacement £ = lf s exp(ik • x — out), where A is the wave amplitude, f s is a unit vector 
(polarization vector) pointing in the direction of the displacement, k is the wave vector, 
and uj is the wave frequency. The wave speed is c s = cu/|k| . Associated with these waves 
are quanta called phonons. As for any wave, each phonon has a momentum related to its 
wave vector by p = frk, and an energy related to its frequency by £ = h~u. Combining 
these relations we learn that the relationship between a phonon's energy and the magnitude 
p = |p| of its momentum is S = c s p. This is the same relationship as for photons, but 
with the speed of light replaced by the speed of sound! For longitudinal waves, is in the 
propagation direction k so there is just one polarization, ql = 1; for transverse waves, fr is 
orthogonal to k, so there are two orthogonal polarizations (e.g. fr = e x and fr = e^ when k 
points in the e z direction); i.e., Qt = 2. 

(a) Phonons of both types, longitudinal and transverse, are bosons. Why? 

(b) Phonons are fairly easily created, absorbed, scattered and thermalized. A general 
argument regarding chemical reactions (Sec. 4.4) can be applied to phonon creation 
and absorption to deduce that, once they reach complete thermal equilibrium with 
their environment, the phonons will have vanishing chemical potential hr = some 
temperature T. What, then, will be their distribution functions r\ and A/"? 

(c) Ignoring the fact that the sound waves' wavelengths A = 27r/|k| cannot be larger than 
about twice the spacing between the atoms of the solid, show that the total phonon 
energy (wave energy) in a volume V of the solid is identical to that for black-body 
photons in a volume V, but with the speed of light c replaced by the speed of sound c s , 
and with the photons' number of spin states, 2, replaced by g s (2 for transverse waves, 
1 for longitudinal): £ Tota i = a s T 4 V, with a s = ^(4tt 5 / 15) (k 4 //i 3 c 3 ); cf. Eqs. (2.40). 

(d) Show that the specific heat of the phonon gas (the sound waves) is Cy = 4a s T 3 V. This 
scales as T 3 , whereas in a metal the specific heat of the degenerate electrons scales as 
T [previous exercise], so at sufficiently low temperatures the electron specific heat will 
dominate over that of the phonons. 

(d) Show that in the phonon gas, only phonon modes with wavelengths longer than ~ 
At = c s h/kT are excited; i.e., for A At the mean occupation number is r] <C 1; for 
A ~ At, rj ~ 1; and for A ^> At, rj ^> 1. As T is increased, At gets reduced. Ultimately 
it becomes of order the interatomic spacing, and our computation fails because most 
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of the modes that our calculation assumes are thermalized actually don't exist. What 
is the critical temperature (Debye temperature) at which our computation fails and 
the T 3 law for Cy changes? Show by a ~ one-line argument that above the Debye 
temperature Cy is independent of temperature. 

2.5 Evolution of the Distribution Function: Liouville's 
Theorem, the Collisionless Boltzmann Equation, 
and the Boltzmann Transport Equation 

We now turn to the issue of how the distribution function fj(V,p), or equivalently Af = 
(g s /h 3 )rj, evolves from point to point in phase space. We shall explore the evolution under 
the simple assumption that between their very brief collisions, the particles all move freely, 
uninfluenced by any forces. It is straightforward to generalize to a situation where the 
particles interact with electromagnetic or gravitational or other fields as they move, and we 
shall do so in Box 2.1, Sec. 3.3, and Chap. 27. However, in the body of this chapter we shall 
restrict attention to the very common situation of free motion between collisions. 

Initially we shall even rule out collisions; only at the end of this section will we restore 
them, by inserting them as an additional term in our collision-free evolution equation for rj. 

The foundation for our collision-free evolution law will be Liouville's Theorem: Consider 
a collection Q of particles which are initially all near some location {V 0l Po) in phase space 
and initially occupy an infinitesimal (frame-independent) phase-space volume dV x dV p there. 
Pick a particle at the center of the collection Q and call it the "fiducial particle". Since 
all the particles in Q have nearly the same initial position V Q and 4- velocity u = p /m, 
they subsequently all move along nearly the same world line through spacetime; i.e., they all 
remain congregated around the fiducial particle. We thus can regard their volume dV x dV p in 
phase space as a function of the fiducial particle's parameter ( (so normalized that the par- 
ticle's straight world line is V(() and its conserved 4-momentum is p Q = dV/d(). Liouville's 
theorem says that the phase-space volume occupied by the particles Q is conserved, 

^(dV x dV p ) = ; (2.55) 

i.e., it is a constant along the world line of the fiducial particle. 

We can prove Liouville's theorem with the aid of the diagrams in Fig. 2.7. Assume, 
for simplicity, that the particles have nonzero rest mass, and switch from ( to the fiducial 
particle's proper time r = m( as the parameter with respect to which changes are monitored. 
Consider the region in phase space occupied by the particles, as seen in the Lorentz frame of 
the fiducial particle (so Lorentz coordinate time t is equal to r). Choose that region to be a 
rectangular box at the initial event V [Fig. 2.7(a)], and set t = r = at that initial event. 
Examine the evolution with time t of the 2-dimensional slice y = p y = z = p z = through 
the occupied region. The evolution of other slices will be similar. Then, as t = r passes, 
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Fig. 2.7: The phase space region (x-p x part) occupied by a collection of particles with finite rest 
mass, as seen in the Lorentz frame of the central, fiducial particle, (a) The initial region, (b) The 
region after a short time. 



the particle at location (x,p x ) moves with velocity dx/dt = p x /m, where the nonrelativistic 
approximation to the velocity is used because all the particles are very nearly at rest in 
the fiducial particle's Lorentz frame. Because the particles move freely, each one has a 
conserved p x , and their motion dx/dt = p x /m (larger speed higher in the diagram) deforms 
the particles' phase space region as shown in Fig. 2.7(b). Obviously, the area of the occupied 
region, AxAp x , is conserved. 

This same argument shows that the x-p x area is conserved at all values of y, z,p y ,p z ; and 
similarly for the areas in the y-p y planes and the areas in the z-p z planes. As a consequence, 
the total volume in phase space, dV x dV p = AxAp x AyAp y AzAp z is conserved. 

Although this proof of Liouville's theorem relied on the assumption that the particles 
have nonzero rest mass, the theorem is also true for particles with zero rest mass — as one 
can deduce by taking the limit as the rest mass goes to zero and the particles' 4-momenta 
become null. 

Since, in the absence of collisions or other nongravitational interactions, the number dN 
of particles in the collection Q will be conserved, Liouville's theorem immediately implies 
also the conservation of the number density in phase space, N = dN/dV x dV p : 

= along the trajectory of a fiducial particle. (2.56) 

This conservation law is called the collisionless Boltzmann equation; and in the context of 
plasma physics it is sometimes called the Vlasov equation. Note that it says that not only 
is the distribution function N frame-independent; N also is constant along the phase-space 
trajectory of any freely moving particle. 

The collisionless Boltzmann equation is actually far more general than is suggested by 
the above derivation; see Box 2.1, which is best read after finishing this section. 

The collisionless Boltzmann equation is most nicely expressed in the frame-independent 
form (2.56). For some purposes, however, it is helpful to express the equation in a form 
that relies on a specific but arbitrary choice of Lorentz reference frame. Then N(V,p) can 
be regarded as a function of the seven phase-space coordinates, N = N{x^ 1 ) pj), and the 
collisionless Boltzmann equation (2.56) then takes the coordinate-dependent form 

dN dx^dN dpj dN r- 9 — r^fdN dN\ n fn rr7 . 

^ = ^^ + lt^ = ^^{^ + ^dx-)=°- (2 ' 57) 
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Here we have used the equations of motion for the fiducial particle: dx^/d^ = p° = £ = 
a/tti 2 + |p| 2 , pj = p°Vj, and dpj/dt = 0. 

Since our derivation of the collisionless Boltzmann equation relied on the assumption that 
no particles are created or destroyed as time passes, the collisionless Boltzmann equation in 
turn should guarantee conservation of the number of particles, V • S — [cf. Eq. (2.33)]. 
Indeed, this is so; see Ex. 2.10. Similarly, since the collisionless Boltzmann equation is based 
on the law of 4-momentum conservation for all the individual particles, it is reasonable to 
expect that the collisionless Boltzmann equation will guarantee the conservation of their 

— * 

total 4-momentum, i.e. will guarantee that V • T = 0. Indeed, this conservation law does 
follow from the collisionless Boltzmann equation; see Ex. 2.10. 

Thus far we have assumed that the particles move freely through phase space with no 
collisions. If collisions occur, they will produce some nonconservation of Af along the trajec- 
tory of a freely moving, noncolliding fiducial particle, and correspondingly, the collisionless 
Boltzmann equation will get modified to read 



dAf fdAf 



d( \d( 



(2.58) 

collisions 



where the right-hand side represents the effects of collisions. This equation, with collision 
terms present, is called the Boltzmann transport equation. The actual form of the collision 
terms depends, of course, on the details of the collisions. We shall meet some specific 
examples in the next section [Eqs. (2.67), (2.72a), and Ex. 2.14] and in our study of plasmas 
(Chaps. 21 and 22). 

Whenever one applies the collisionless Boltzmann or Boltzmann transport equation to a 
given situation, it is helpful to simplify one's thinking in two ways: (i) Adjust the normaliza- 
tion of the distribution function so it is naturally tuned to the situation. For example, when 
dealing with photons, I u /^ is typically best, and if — as is usually the case — the photons 
do not change their frequencies as they move and only a single reference frame is of any 
importance, then I v alone may do; see Ex. 2.11. (ii) Adjust the differentiation parameter (( 
above) so it is also naturally tuned to the situation. For example, instead of (, one might 
use the distance / or time t traveled in some preferred inertial reference frame. 



EXERCISES 



Exercise 2.10 Derivation and Problem: Collisionless Boltzmann Implies Conservation of 
Particles and of 4- Momentum 

Consider a collection of freely moving, noncolliding particles, which satisfy the collisionless 
Boltzmann equation dM '/d( = 0. Show that this equation guarantees that the conservation 

—* — * — * — * 

laws V • S — and V ■ T = are satisfied, where the number-flux vector S and the stress- 
energy tensor T are expressed in terms of Af by the momentum-space integrals (2.30b) and 
(2.30c). 

Exercise 2.11 Problem: Solar Heating of the Earth: The Greenhouse Effect 
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Box 2.1 

Sophisticated Derivation of collisionless Boltzmann Equation 

Denote by X = {V,p} a point in 8-dimensional phase space. In a Lorentz frame 
the coordinates of X can be taken to be {x° , x 1 , x 2 , x 3 , po, pi, P2, P3} ■ [We use up indices 
( "contravariant" indices) on x and down indices ( "covariant" indices) on p because this 
is the form required in Hamilton's equations below; i.e., it is p a not p a that is canonically 
conjugate to x a .} Regard Af as a function of location X in 8-dimensional phase space. 
The fact that our particles all have the same rest mass so M is nonzero only on the mass 
hyperboloid means that as a function of X, M entails a delta function. For the following 
derivation that delta function is irrelevant; the derivation is valid also for distributions 
of non-identical particles, as treated in Ex. 2.1. 

— * 

A particle in our distribution at location X moves through phase space along a world 

— » — * 

line with tangent vector dX/d(, where ( is its affine parameter. The product AfdX/d( 
represents the number-flux 8-vector of particles through spacetime, as one can see by 
an argument analogous to Eqs. (2.32). We presume that, as the particles move through 
phase space, none are created or destroyed. The law of particle conservation in phase 
space, by analogy with V • S = in spacetime, takes the form V • {MdX/dQ = 0. In 
terms of coordinates in a Lorentz frame, this conservation law says 

9 ^ + »(^| =0 . 



dx a \ d( J dp a \ d( 
The motions of individual particles in phase space are governed by Hamilton's equations 

dx a &H dp a dU 



d( dp a ' d( dx a 



(2) 



For the freely moving particles of this chapter, the relativistic Hamiltonian is [cf. Eq. 
(8.62) of Goldstein (1980) and pp. 488-489 of Misner, Thorne and Wheeler (1973)] 

n=^(p aPp g^-m 2 ). (3) 

Our derivation of the collisionless Boltzmann equation does not depend on this specific 
form of the Hamiltonian; it is valid for any Hamiltonian and thus, e.g., for particles inter- 
acting with an electromagnetic field or even a relativistic gravitational field (spacetime 
curvature; Part VI). By inserting Hamilton's equations (2) into the 8-dimensional law of 
particle conservation (1), we obtain 



dx a \ dp a J dp a \ dx 
Using the rule for differentiating products, and noting that the terms involving two 
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Box 2.1, Continued 

derivatives of Ti cancel, we bring this into the form 

(W _ <W _ dN^dx^ _ dN^djf^ _ dj\f_ 
~~ dx a dp a dp a dx a ~ dx a d( dp a d( ~ d( ' 

which is the collisionless Boltzmann equation. [To get the second expression we have 
used Hamilton's equations, and the third follows directly from the formulas of differential 
calculus; cf. Eq. (2.57).] Thus, the collisionless Boltzmann equation is a consequence of 
just two assumptions, conservation of particles and Hamilton's equations for the motion 
of each particle, which implies it has very great generality. We shall extend and explore 
this generality in the next chapter. 



In this example we shall study the heating of the Earth by the Sun. Along the way, we shall 
derive some important relations for black-body radiation. 

Since we will study photon propagation from the Sun to the Earth with Doppler shifts 
playing a negligible role, there is a preferred inertial reference frame: that of the Sun and 
Earth with their relative motion neglected. We shall carry out our analysis in that frame. 
Since we are dealing with thermalized photons, the natural choice for the distribution func- 
tion is I u /v 3 ; and since we use just one unique reference frame, each photon has a fixed 
frequency v, so we can forget about the z/ 3 and use l v . 

(a) Assume, as is very nearly true, that each spot on the sun emits black-body radiation in 
all outward directions with a temperature T = 5800 K. Show, by integrating over the 
black-body I u , that the total energy flux (i.e. power per unit surface area) F emitted 
by the Sun is 

F = -7^77 = ctTq , where a = ^ = = 5.67 x lO" 5 — . (2.59) 

dtdA 4 15 h 3 c 2 cm 2 sK 4 v ; 

(b) Since the distribution function I v is conserved along each photon's trajectory, observers 
on Earth, looking at the sun, see identically the same black-body specific intensity I v 
as they would if they were on the Sun's surface. (No wonder our eyes hurt if we look 
directly at the Sun!). By integrating over I u at the Earth [and not by the simpler 
method of using Eq. (2.59) for the flux leaving the Sun], show that the energy flux 
arriving at Earth is F = o~Tq(Rq j V) 2 , where R & = 696, 000 km is the Sun's radius and 
r = 1.496 x 10 8 km is the distance from the Sun to Earth. 

(c) Our goal is to compute the temperature T e of the Earth's surface. As a first attempt, 
assume that all the Sun's flux arriving at Earth is absorbed by the Earth's surface, 
heating it to the temperature T e , and then is reradiated into space as black-body 
radiation at temperature T e . Show that this leads to a surface temperature of 



R Q 



1/2 



T e =T (^) = 280K = 7C. (2.60) 
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This is a bit cooler than the correct mean surface temperature (287 K = 14 C). 

(d) Actually, the Earth has an "albedo" of A = 0.30, which means that 40 per cent of the 
sunlight that falls onto it gets reflected back into space with an essentially unchanged 
spectrum, rather than being absorbed. Show that with only a fraction 1 — A = 0.70 
of the solar radiation being absorbed, the above estimate of the Earth's temperature 
becomes ^ 

T @ =T (^- ~^ R& ^j = 255K = -18C. (2.61) 

This is even farther from the correct answer. 

(e) The missing piece of physics, which raises the temperature from — 27C to something 
much nearer the correct 14C, is the Greenhouse Effect: The absorbed solar radiation 
has most of its energy at wavelengths ~ 0.5/im (in the visual band), which pass rather 
easily through the Earth's atmosphere. By contrast, the black-body radiation that 
the Earth's surface wants to radiate back into space, with its temperature ~ 300K, 
is concentrated in the infrared range from ~ 8/iin to ~ 30//m. Water molecules and 
carbon dioxide in the Earth's atmosphere absorb about half of the energy that the Earth 
tries to reradiate at these energies, 5 causing the reradiated energy to be about half that 
of a black body at the Earth's surface temperature. Show that with this "Greenhouse" 
correction, T e becomes about 293K = +20C. Of course, the worry is that human 
activity will increase the amount of carbon dioxide in the atmosphere by enough to 
raise the Earth's temperature still further, thereby disrupting our comfortable lives. 

Exercise 2.12 Challenge: Otters' Paradox and Solar Furnace 

Consider a universe in which spacetime is flat and is populated throughout by stars that 
cluster into galaxies like our own and our neighbors, with interstellar and intergalactic dis- 
tances similar to those in our neighborhood. Assume that the galaxies are not moving apart, 
i.e., there is no universal expansion. Using the collisionless Boltzmann equation for photons, 
show that the Earth's temperature in this universe would be about the same as the surface 
temperatures of the universe's hotter stars, ~ 10, 000 K, so we would all be fried. What 
features of our universe protect us from this fate? 

Motivated by this model universe, describe a design for a furnace that relies on sunlight 
for its heat and achieves a temperature nearly equal to that of the sun's surface, 5770 K. 

2.6 Transport Coefficients 

In this section we turn to a practical application of kinetic theory: the computation of 
transport coefficients. Our primary objective is to illustrate the use of kinetic theory, but 
the transport coefficients themselves are also of interest: they will play an important role in 
Parts IV and V of this book (Fluid Mechanics and Plasma Physics). 

5 See, e.g., the section and figures on "Absorption of Atmospheric Gases" in Allen (2000) 
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What are transport coefficients? An example is the electrical conductivity a of a sample of 
matter. Ohm's law says that, as measured in the sample's proper reference frame, the current 
density J is directly proportional to the electric field E, with proportionality constant a; i.e., 
J = crE. The electrical conductivity is high if electrons can move through the material with 
ease; it is low if electrons have difficulty moving. The impediment to the electron motion 
is scattering off other particles — off ions, other electrons, phonons (sound waves), plasmons 
(plasma waves), .... In order to compute the electrical conductivity, one must analyze the 
effects of those scatterings on the motion of the electrons; and since huge numbers of electrons 
and scatterers are involved, the analysis must be statistical. The foundation for an accurate 
analysis is the Boltzmann transport equation. 

Another example of a transport coefficient is the thermal conductivity n, which enters into 
the law of heat conduction, q = -kVT where q is the energy flux and T is the temperature 
as measured in the mean rest frame of the matter. The impediment to heat conduction 
is the scattering of the conducting particles off other particles; and, correspondingly, the 
foundation for accurately computing the thermal conductivity k is the Boltzmann transport 
equation. 

Other examples of transport coefficients are the coefficient of shear viscosity r/ s h ear which 
enters into the Navier Stokes equation of fluid mechanics (Part IV of this book, where it is 
denoted r]), and which describes the ability of a fluid to transport momentum in directions 
transverse to its flow; and the diffusion coefficient D which enters into the diffusion equation 
(Ex. 2.13 below) and describes the ability of particles to diffuse through matter. 

The physical laws into which these transport coefficients enter (Ohm's law, the law of 
heat conduction, the Navier-Stokes equation, the diffusion equation) are approximations to 
the real world — approximations that are valid if and only if (i) many particles are involved 
in the transport of the quantity of interest (charge, heat, momentum, particles) and (ii) on 
average each particle undergoes many scatterings in moving over the length scale of the 
macroscopic inhomogeneities that drive the transport. This second requirement for validity 
can be expressed quantitatively in terms of the mean free path A between scatterings (i.e., 
the mean distance a particle travels between scatterings, as measured in the mean rest frame 
of the matter) and the macroscopic inhomogeneity scale C for the quantity that drives the 
transport (for example, in heat transport that scale is L ~ T/|VT|, i.e., it is the scale on 
which the temperature changes by an amount of order itself). In terms of these quantities, 
the second criterion of validity is that A <C C. These two criteria (many particles and 
A <C C) together are called diffusion criteria, since they guarantee that the quantity being 
transported (charge, heat, momentum, particles) will diffuse through the matter. If either 
of the two diffusion criteria fails, then the standard transport law (Ohm's law, the law of 
heat conduction, the Navier-Stokes equation, or the diffusion equation) breaks down and the 
corresponding transport coefficient becomes irrelevant and meaningless. 

The accuracy with which one can compute a transport coefficient using the Boltzmann 
transport equation depends on the accuracy of one's description of the scattering. If one 
uses a high-accuracy collision term (dj\f/d() collisions i n the Boltzmann equation, one can 
derive a highly accurate transport coefficient. If one uses a very crude approximation for the 
collision term, one's resulting transport coefficient might be accurate only to within an order 
of magnitude — in which case, it was probably not worth the effort to use the Boltzmann 
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equation; a simple order-of-magnitude argument would have done just as well. 

In this section we shall compute the coefficient of thermal conductivity k first by an order- 
of-magnitude argument, and then by the Boltzmann equation with a highly accurate collision 
term. In Ex. 2.13 readers will have the opportunity to compute the diffusion coefficient using 
a moderately accurate collision term. 

2.6.1 Problem to be Analyzed: Diffusive Heat Conduction Inside 
a Star 

The specific problem we shall treat here in the text is heat transport through hot gas deep 
inside a young, massive star. We shall confine attention to that portion of the star in 
which the temperature is 10 7 K < T < 10 9 K, the mass density is p < 10 g/cm 3 (T/10 7 K) 2 , 
and heat is carried primarily by diffusing photons rather than by diffusing electrons or ions 
or by convection. (We shall study convection in Chap. 17.) In this regime the primary 
impediment to the photons' flow is collisions with electrons. The lower limit on temperature, 
10 7 K ~ T, guarantees that the gas is almost fully ionized, so there is a plethora of electrons 
to do the scattering. The upper limit on density, p ~ 10g/cm 3 (T/10 7 K) 2 guarantees that 
(i) the scattering electrons are nondegenerate, i.e., they have mean occupation numbers i] 
small compared to unity and thus behave like classical, free, charged particles; and (it) the 
inelastic scattering, absorption, and emission of photons by electrons that are accelerating 
in the coulomb fields of ions ( "bremsstrahlung" processes) are unimportant as impediments 
to heat flow compared to scattering off free electrons. The upper limit on temperature, 
T ~ 10 9 K, guarantees that (i) the electrons which do the scattering are moving thermally 
at much less than the speed of light (the mean thermal energy 1.5&;T of an electron is much 
less than its rest mass-energy m e c 2 ); and (ii) the scattering is nearly elastic, with negligible 
energy exchange between photon and electron, and is describable with good accuracy by the 
Thomson scattering cross section: 

In the rest frame of the electron, which to good accuracy will be the same as the mean 
rest frame of the gas since the electron's speed relative to the mean rest frame is <C c, the 
differential cross section da for a photon to scatter from its initial propagation direction n' 
into a unit solid angle dfl centered on a new propagation direction n is 



Here <r T is the total Thomson cross section [the integral of the differential cross section 
(2.62a) over solid angle] 





(2.62a) 



dn 



0"t = 



do~( 



n ' -> n ) 

dVL 



dtt = —r 2 = 0.665 x 10 



-24 




(2.62b) 



where r Q = e 2 /m e c 2 is the classical electron radius. For a derivation and discussion of the 
differential Thomson cross section (2.62a) and the total Thomson cross section (2.62b) see, 
e.g., Sec. 14.8 of Jackson (1999). 



36 



— »- 



acaT Q 4 



;r;:i acaTy 



z = 



% 
i 



Fig. 2.8: Heat exchange between two layers of gas separated by a distance of one photon mean 
free path in the direction of the gas's temperature gradient. 

2.6.2 Order-of-Magnitude Analysis 

Before embarking on any complicated calculation, it is always helpful to do a rough, order- 
of-magnitude analysis, thereby identifying the key physics and the approximate answer. The 
first step of a rough analysis of our heat transport problem is to identify the magnitudes of 
the relevant lengthscales. The inhomogeneity scale £ for the temperature, which drives the 
heat flow, is the size of the hot stellar core, a moderate fraction of the Sun's radius: £ ~ 10 5 
km. The mean free path of a photon can be estimated by noting that, since each electron 
presents a cross section ctt to the photon and there are n e electrons per unit volume, the 
probability of a photon being scattered when it travels a distance / through the gas is of 
order n e a^l\ and therefore to build up to unit probability for scattering, the photon must 
travel a distance 

1 m v „ / lg/cm 3 \ „ . . 

A ~ ~ — ~ 3 cm — ~ 3 cm . (2.63) 

n e a T pa T \ p J 

Here m p is the proton rest mass, p ~ 1 g/cm 3 is the mass density in the core of a young, 
massive star, and we have used the fact that stellar gas is mostly hydrogen to infer that there 
is approximately one nucleon per electron in the gas and hence that n e ~ p/rn p . Note that 
£ ~ 10 5 km is roughly 3 x 10 4 times larger than A ~ 3 cm, and the number of electrons and 
photons inside a cube of side £ is enormous, so the diffusion description of heat transport is 
quite accurate. 

In the diffusion description the heat flux q as measured in the gas's rest frame is related 
to the temperature gradient VT by 

q = -kVT , (2.64) 

where k is the thermal conductivity. To estimate k, orient the coordinates so the temperature 
gradient is in the z direction, and consider the rate of heat exchange between a gas layer 
located near z = and a layer one photon-mean-free-path away, at z = A (Fig. 2.8). The heat 
exchange is carried by photons that are emitted from one layer, propagate nearly unimpeded 
to the other, and then scatter. Although the individual scatterings are nearly elastic (and 
we thus are ignoring changes of photon frequency in the Boltzmann equation), tiny changes 
of photon energy add up over many scatterings to keep the photons nearly in local thermal 
equilibrium with the gas. Thus, we shall approximate the photons and gas in the layer at 
z = to have a common temperature T and those in the layer at z = A to have a common 
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temperature T A = T + XdT/dz. Then the photons propagating from the layer at z = to 
that at z = A carry an energy flux 

g A _ = «ca(T ) 4 , (2.65a) 

where a is the radiation constant of Eq. (2.40b), a(T ) 4 is the photon energy density at 
z — 0, and a is a dimensionless constant of order 1/4 that accounts for what fraction of the 
photons at z = are moving rightward rather than leftward, and at what mean angle to 
the z direction. (Throughout this section, by contrast with early sections of this chapter, we 
shall use cgs units with c present explicitly). Similarly, the flux of energy from the layer at 
z = A to the layer at z = is 

g A ^ = -aca(T x ) 4 ; (2.65b) 
and the net rightward flux, the sum of (2.65a) and (2.65b), is 

q = aca[(T ) A - (T A ) 4 ] = -4acaT 3 \^-. (2.65c) 

dz 

Noting that 4a is of order unity, inserting expression (2.63) for the photon mean free path, 
and comparing with the law of diffusive heat flow q = -kVT, we conclude that the thermal 
conductivity is 

nrT 3 

K ~ aT 3 c\ = — . (2.66) 
o- T n e 

2.6.3 Analysis via the Boltzmann Transport Equation 

With these physical insights and rough answer in hand, we turn to a Boltzmann transport 
analysis of the heat transfer. Our first step is to formulate the Boltzmann transport equation 
for the photons (including effects of Thomson scattering off the electrons) in the rest frame 
of the gas. 

The effects of electron scattering are most easily written down in terms of the specific 
intensity I v of the photons, so we initially shall use I v as our distribution function. (We can 
use I u rather than I v /v z = (h 4 /c 2 )Af because each photon's frequency v is conserved in our 
chosen Lorentz frame as it propagates.) It is easiest to study the scattering as a function of 
the Lorentz coordinate time t the photons have been traveling, or equally well the distance 
ct traveled, so we shall use t as our independent variable. 

Consider those photons that propagate very nearly in the n direction as seen in the rest 
frame of the gas, and that have frequencies very nearly equal to v. The specific intensity 
[Eq. (2.8)] describes the energy per unit time per unit area per unit solid angle per unit 
frequency (dS / dtdAdQdu; "energy per unit everything") carried by those photons. If ob- 
servers in the rest frame of the gas watch the evolution of I u in the neighborhood of a fiducial 
photon (i.e., moving with that photon at the speed of light in the n direction), they will 
see I v change as a result of two things: (i) the scattering of photons out of the n direction 
and into other directions, and (it) the scattering of photons from other directions n' into the 
n direction. Since these scatterings leave the frequency of the scattered photon unchanged 
(a key property of Thomson scattering), we can treat the frequency v and the frequency 
interval dv as constant in our description of the scattering. Scattering out of and into the 
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n direction are embodied quantitatively in the two terms that contribute to the following 
expression for the rate of change of l v : 

dlJn.is) . , f da(n' — > n) r . . . 

K dt ! = -a T n e d v (n,u)+ / — ^ L n e cI u {n',u)dQ' . (2.67) 

Here t is proper time (Lorentz coordinate time) in the gas's rest frame, n e is the number 
density of electrons in that frame, and all other quantities have previously been defined. 
The first term represents scattering out of the n direction into other directions; the second, 
scattering into the n direction from other directions n'. That these terms have the forms 
shown should be obvious from the physical meanings of the scattering cross sections, together 
with the fact that the photons are moving with the speed of light c. 

From the relation N = constant xl u /u 3 [Eq. (2.9)], and the fact that the photon frequency 
v (as measured in the gas rest frame) is a constant along the trajectory of our fiducial photon, 
we infer that the distribution function N must satisfy the same transport equation (2.67) as 
the specific intensity: 

dN(t, x, n, v) . f da(\i' — > n 



dt 



= -a T n e cN(t, x, n, u) + / U .Z* n e cN (t, x, n', u)dQ' . (2.68a) 

J dll 



Here our notation shows explicitly that N depends not only on the photon propagation 
direction and frequency (as measured in the mean rest frame of the gas), but also on time t 
and spatial location x. To make the notation fully explicit, we must also note that the time 
derivative d/dt on the left side is taken moving with the fiducial photon, i.e., 

d d d • . 

A=m +cn >8^- < 2 ' 68b > 

(There are no contributions [dv / 'dt)d / 'dv or (drij / 'dt)d / 'drij in (2.68b) because photon fre- 
quency v and direction of propagation rij are constants along the trajectory of a fiducial 
photon in the gas's mean rest frame; i.e., the tiny effects of gravitational redshift and of 
gravitational light deflection are being neglected.) 

Equation (2.68a), with d/dt given by (2.68b), is the Boltzmann transport equation for 
the photons, written using the gas's proper time t as the parameter along the fiducial photon 
trajectory. The terms on the right side of (2.68a) are the explicit realizations, for our star's 
gas, of the Boltzmann equation's collision terms. 

Because the mean free path A is so short compared to the length scale £ of the tem- 
perature gradient, the heat flow will show up as a tiny correction to an otherwise isotropic, 
perfectly thermal distribution function. Thus, we can write the photon distribution func- 
tion as the sum of an unperturbed, perfectly isotropic and thermalized piece Ao and a tiny, 
anisotropic perturbation Af\. 

M = K + M , where Af = A —1 _ - . ( 2 .69a) 



Here the perfectly thermal piece No has the standard black-body form (2.21); it depends 
on the photon 4-momentum only through the frequency v as measured in the mean rest 
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Box 2.2 
Two-Lengthscale Expansions 

Equation (2.69b) is indicative of the mathematical technique that underlies 
Boltzmann-transport computations: a perturbative expansion in the dimensionless ratio 
of two lengthscales, the tiny mean free path A of the transporter particles and the far 
larger macroscopic scale C of the inhomogeneities that drive the transport. Expansions 
in lengthscale ratios X/C are called two-lengthscale expansions, and are widely used in 
physics and engineering. Most readers will previously have met such an expansion in 
quantum mechanics: the WKB approximation, where A is the lengthscale on which the 
wave function changes and C is the scale of changes in the potential V(x) that drives 
the wave function. Kinetic theory itself is the result of a two-lengthscale expansion: It 
follows from the more sophisticated statistical-mechanics formalism in Chap. 3, in the 
limit where the particle sizes are small compared to their mean free paths. In this book 
we shall use two-lengthscale expansions frequently — e.g., in the geometric optics approxi- 
mation to wave propagation (Chap. 6), in the study of boundary layers in fluid mechanics 
(Sees. 13.4 and 14.4), and in the definition of a gravitational wave (Chap. 26). 



frame of the gas, and it depends on location in spacetime only through the temperature 
T, which we assume is time independent in the star but is a function of spatial location, 
T = T(x). If the photon mean free path were vanishingly small, there would be no way 
for photons at different locations x to discover that the temperature is inhomogeneous; and, 
correspondingly, Ai would be vanishingly small. The finiteness of the mean free path permits 
Ni to be finite, and it is reasonable to expect (and turns out to be true) that the magnitude 
of Mi is 

Ni ~ 4 Mo ■ (2.69b) 
Co 

Thus, Ao is the leading order term, and Ai is the first-order correction in an expansion of 
the distribution function M in powers of X/C. This is called a two-lengthscale expansion] see 
Box 2.2. 

Because the temperature and density inside the star are independent of time t, the 
distribution function M = Ao + Ai is independent of time; and the Boltzmann transport 
equation (2.68), for A" — Ao + Ai takes the form 



crii 



dMo 



+ cn 



dMi 

dx,; 



-o- T n e cAf + J n e cj\f dn' 



+ 



—a T n e cJ\fi 



(n, u) + J 



da(n' — > n) 



n e cj\fi(ri, is)dtt' 



.(2.69c) 



Because Ao is isotropic, i.e., is independent of photon direction n', it can be pulled out of 
the integral over n' in the first square bracket on the right side; and when this is done, the 
first and second terms in that square bracket cancel each other. Thus, the unperturbed part 
of the distribution, Ao, completely drops out of the right side of (2.69c). On the left side 
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the term involving the perturbation M\ is tiny compared to that involving the unperturbed 
distribution A/"o, so we shall drop it; and because the spatial dependence of A/"o is entirely due 
to the temperature gradient, we can bring the first term and the whole transport equation 
into the form 

n ^^W = ~ a ^ M ^ V ) + J da{n ' d n - n ^( n> ' v ) dn ' ■ ( 2 - 69d ) 

The left side of this equation is the amount by which the temperature gradient causes Ao 
to fail to satisfy the Boltzmann equation, and the right side is the manner in which the 
perturbation A/i steps into the breach and enables the Boltzmann equation to be satisfied. 

Because the left side is linear in the photon propagation direction rij (i.e., it has a cos 6 
dependence in coordinates where VT is in the ^-direction; i.e., it has a "dipolar", I = 1 
angular dependence), A/i must also be linear in rij, i.e. dipolar, in order to fulfill Eq. (2.69d). 
Thus, we shall write A/i in the dipolar form 

A/i = Kj(x,v)rij , (2.69e) 

and we shall solve the transport equation (2.69d) for the function Kj. 

[Important side remark: This is a special case of a general situation: When solving 
the Boltzmann equation in diffusion situations, one is performing a power series expansion 
in X/C] see Box 2.2. The lowest order term in the expansion, Ao, is isotropic, i.e., it is 
monopolar in its dependence on the direction of motion of the diffusing particles. The first- 
order correction, A/i, is down in magnitude by X/C from A/o and is dipolar in its dependence 
on the particles' direction of motion. The second-order correction, A2, is down in magnitude 
by (X/C) 2 from Ao and is quadrupolar in its direction dependence. And so it continues on 
up to higher and higher orders. 6 ] 

When we insert the dipolar expression (2.69e) into the angular integral on the right side 
of the transport equation (2.69d) and notice that the differential scattering cross section 
(2.62a) is unchanged under n' — > — n', but Kjii'j changes sign, we find that the integral 
vanishes. As a result the transport equation (2.69d) takes the simplified form 

nj £hT j ~&r = -°^ K 3 n i ' ( 2 - 69f ) 

from which we can read off the function Kj and thence A/i = Kjfij: 

A/i = n . 2 .69g 

<7rn e GXj 

Notice that, as claimed above, the perturbation has a magnitude 

§ ~ ~WT\ ~ * (2.69h) 
A/q o- T n e 1 C 



6 For full details in nonrelativistic situations see, e.g., Grad (1957); and for full details in the fully rela- 
tivistic case see, e.g., Thome (1981). 
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We can now evaluate the energy flux carried by the diffusing photons. That flux qi is 
the T° l part of the stress-energy tensor. By inserting M — Ao + Mi into the momentum- 
space integral (2.30c) for the stress-energy tensor and noting that because Mq is isotropic its 
contribution to the integral vanishes, we obtain 

* S 7* = f W£ = — f j 9 -£n, P ° P , d 4. (270a) 
J p° a T n e dxj J dT p° 

By then noting that for photons rijp = pj and pulling the derivative with respect to tem- 
perature out from under the integral sign and noting that the integral then becomes that for 
the photons' stress tensor T jt = P5 j \ and by using P = \aT 4 [Eq. (2.40a)] for the photons' 
pressure P, we obtain 

c dT d f dV p c dT d 1 



a^n e dxj dT J p° o~Tn e dxj dT 3 

= ~—t aT3 ^r- ( 2 - 70b ) 

<j r ^n e 3 oxi 

Thus, from the Boltzmann transport equation we have simultaneously derived the law of 
diffusive heat conduction q = -kVT and evaluated the coefficient of heat conductivity 

4 acT 3 

k=- . (2.71) 

3 ffTle 

Notice that this heat conductivity is 4/3 times our crude, order-of-magnitude estimate (2.66). 

The above calculation, while somewhat complicated in its details, is conceptually fairly 
simple. The reader is encouraged to go back through the calculation and identify the main 
conceptual steps (expansion of distribution function in powers of X/C, insertion of zero- 
order plus first-order parts into the Boltzmann equation, multipolar decomposition of the 
zero and first-order parts with zero-order being monopolar and first-order being dipolar, 
neglect of terms in the Boltzmann equation that are smaller than the leading ones by factors 
X/C, solution for the coefficient of the multipolar decomposition of the first-order part, 
reconstruction of the first-order part from that coefficient, and insertion into a momentum- 
space integral to get the flux of the quantity being transported). Precisely these same steps 
are used to evaluate all other transport coefficients that are governed by classical physics. For 
examples of other such calculations see, e.g., Shkarofsky, Johnston, and Bachynski (1966). 

As an application of the thermal conductivity (2.71), consider a young (main-sequence) 7 
solar mass (7M ) star as modeled, e.g., on page 480 of Clayton (1968). Just outside the star's 
convective core, at radius r ~ O.8i? ~ 6 x 10 5 km (where R & is the Sun's radius), the density 
and temperature are p ~ 5g/cm 3 and T ~ 1.6 x 10 7 K, so the number density of electrons is 
n e ~ p/m p ~ 3 x 10 24 cm~ 3 . For these parameters, Eq. (2.71) gives a thermal conductivity 
k ~ 7 x 10 17 erg s~ 1 cm~ 2 K~ 1 . The lengthscale £ on which the temperature is changing 
is approximately the same as the radius, so the temperature gradient is |VT| ~ T/r ~ 
3 x 10~ 4 K/cm. The law of diffusive heat transfer then predicts a heat flux q = /t|VT| ~ 
2 x 10 14 erg s _1 cm~ 2 , and thus a total luminosity L = 4irr 2 q ~ 8 x 10 36 erg/s ~ 2OOOL (2000 
solar luminosities). What a difference the mass makes! The heavier a star, the hotter its 
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core, the faster it burns, and the higher its luminosity. Increasing the mass by a factor 7 
drives the luminosity up by 2000. 



EXERCISES 



Exercise 2.13 Example: Diffusion Coefficient Computed in the "Collision- Time" Approx- 
imation 

Consider a collection of identical "test particles" with rest mass m^O that diffuse through a 
collection of thermalized "scattering centers" . (The test particles might be molecules of one 
species, and the scattering centers might be molecules of a much more numerous species.) 
The scattering centers have a temperature T such that kT <C mc 2 , so if the test particles 
acquire this same temperature they will have thermal speeds small compared to the speed 
of light, as measured in the mean rest frame of the scattering centers. We shall study the 
effects of scattering on the test particles using the following "collision-time" approximation 
for the collision terms in the Boltzmann equation: 

fdhf\ 1 - P 2 /2mkT 

All frame-dependent quantities appearing here are evaluated in the mean rest frame of the 
scattering centers; in particular, t is Lorentz time, T is the temperature of the scattering 
centers, p — |p| is the magnitude of the test particles' spatial momentum, n is the number 
density of test particles 

n = J J\fdp x dp y dp z , (2.72b) 
and f is a constant to be discussed below. 



(a) Show that this collision term preserves test particles in the sense that 

(ir) = I f^r) dp x d Pv dp g = . (2.73) 

\ at / collision J \ at J collision 



(b) Explain why this collision term corresponds to the following physical picture: Each 
test particle has a probability per unit time 1/f of scattering; and when it scatters, 
its direction of motion is randomized and its energy is thermalized at the scattering 
centers' temperature. 

(c) Suppose that the temperature T is homogeneous (spatially constant), but that the test 
particles are distributed inhomogeneously, n = n(x) ^const. Let C be the length scale 
on which their number density n varies. What condition must C, f, T, and m satisfy 
in order that the diffusion approximation be reasonably accurate? Assume that this 
condition is satisfied. 
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(d) The inhomogeneity of n induces a net flux of particles S, from high-density regions 
toward low- density regions, given by 

S = -DVn . (2.74a) 

This is the analog of Ohm's law J = a~E for the flow of electric charge and of q = -kVT 
for the flow of heat, and D is the diffusion coefficient. Give an order-of-magnitude 
derivation of the diffusion coefficient D. 

(e) Show that the law of particle conservation reduces, in this situation, to the following 

diffusion equation for n(t, x): 

On 

— + V • (-DVn) = . (2.74b) 
ot 

(f) Show that for this nonrelativistic problem the Boltzmann transport equation takes the 
form 

-m + ^ = -r^~^- (2 ' 75a) 

(g) Show that to first order in a small diffusion-approximation parameter, the solution of 
this equation is A" — Ao + Ai , where Ao is as defined in Eq. (2.72a) above, and 

1 m dxj (2-KmkTf/ 2 ' K ' 

Note that Ao is monopolar (independent of the direction of p), while Ai is dipolar 
(linear in p). 

(h) Show that the perturbation Ai gives rise to a particle flux given by Eq. (2.74a), with 
the diffusion coefficient 

kT 

D = — t . 2.76 

m 



Exercise 2.14 Example: Neutron Diffusion in a Nuclear Reactor 

Here are some salient, oversimplified facts about nuclear reactors (see, e.g., Stephenson 1954, 
especially Chap. 4): A reactor's core is made of a mixture of natural uranium (0.72 percent 
235 U and 99.28 percent 238 U) and a solid or liquid material such as carbon (graphite) or 
water, made of low-atomic-number atoms and called the moderator. For concreteness we 
shall assume that the moderator is graphite. Slow (thermalized) neutrons, with kinetic 
energies of ~ 0.1 eV, get captured by the 235 U nuclei and trigger them to fission, releasing 
~ 170 MeV of kinetic energy per fission (which ultimately goes into heat and then electric 
power), and also releasing an average of about 2 fast neutrons (kinetic energies ~ 1 MeV). 
The fast neutrons must be slowed to thermal speeds in order for them to capture onto 235 U 
atoms and induce further fissions. The slowing is achieved by scattering off the moderator 
atoms. The scattering is elastic and is isotropic in the center-of-mass frame with total cross 
section a s ~ 4.8 x 10~ 24 cm 2 = 4.8 barns. Using the fact that the incoming neutron has a much 
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higher kinetic energy than the moderator carbon atoms, and using energy and momentum 
conservation and the isotropy of the scattering, one can show that the logarithm of the 
neutron's energy is reduced in each scattering by an average amount £ that is independent 
of energy and is given by: 

£ ee -AhT£ = 1 + In (j^J ~ 0.158 . (2.77) 

Here A ~ 12 is the ratio of the mass of the scattering atom to the mass of the scattered 
neutron. 

There is a dangerous hurdle that the diffusing neutrons must overcome during their 
slowdown: as the neutrons pass through a critical energy region of about 7 to 6 eV, the 
238 U atoms can absorb them. The absorption cross section has a huge resonance there, 
with width about 1 eV and resonance integral J a a d\nE ~ 240 barns. For simplicity, we 
shall approximate the cross section in this absorption resonance by a a ~ 1600 barns at 
6eV < E < 7eV, and zero outside this range. To achieve a viable fission chain reaction 
and keep the reactor hot, it is necessary that about half of the neutrons (one per original 
235 U fission) slow down through this resonant energy without getting absorbed. Those that 
make it through will thermalize and trigger new 235 U fissions (about one per original fission), 
maintaining the chain reaction. 

We shall idealize the Uranium and moderator atoms as homogeneously mixed on length- 
scales small compared to the neutron mean free path, A s = l/(a s n s ) ~ 2.5 cm, where n s is 
the number density of moderator (carbon) atoms. Then the neutrons' distribution function 
Af, as they slow down, will be isotropic in direction and independent of position; and in our 
steady state situation, it will be independent of time. It therefore will depend only on the 
neutron kinetic energy E, Af = Af(E). Use the Boltzmann transport equation or other con- 
siderations to develop the theory of the slowing down of the neutrons, and of their struggle 
to pass through the 238 U resonance region without getting absorbed. More specifically: 

(a) Use as the distribution function ue{E) ee dN/d 3 xdE = (number of neutrons per unit 
volume and per unit kinetic energy), and denote by q(E) the number of neutrons per 
unit volume that slow down through energy E per unit time. Show that outside the 
resonant absorption region these two quantities are related by 

q = a s n s £,E uev , where v = V2mE (2.78) 

is the neutron speed, so q contains the same information as the distribution function 
he- Explain why the steady-state operation of the nuclear reactor requires q to be 
independent of energy in this non- absorption region, and infer that % oc E~ z l 2 . 

(b) Show further that inside the resonant absorption region, 6ev < E < 7eV, the relation- 
ship between q and E is modified: 

q = {<J s n s + a E n E )^En E v . (2.79) 

Here n a is the number density of absorbing ( 238 U) atoms and n s is the number density 
of scattering (carbon) atoms. [Hint: require that the rate at which neutrons scatter 
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into a tiny interval of energy SE <C £E is equal to the rate at which they leave that 
tiny interval.] Then show that the absorption causes q to vary with energy according 
to the following differential equation: 



(c) By solving this differential equation in our idealization of constant o~ a over the range 6 
to 7 eV, show that the condition to maintain the chain reaction is 



Thus, to maintain the reaction in the presence of the huge 238 U absorption resonance 
for neutrons, it is necessary that less that more than 99 per cent of the reactor volume 
be taken up by moderator atoms and less than 1 per cent by uranium atoms. 

We reiterate that this is a rather idealized version of what happens inside a nuclear reactor, 
but it provides insight into some of the important processes and the magnitudes of various 
relevant quantitities. For a graphic example of an additional complexity, see the description 
of "Xenon poisoning" of the chain reaction in the first production-scale nuclear reactor 
(built during World War II to make plutonium for the first American atomic bombs), in 
John Archibald Wheeler's autobiography (Wheeler 1998). 
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